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Preface 


Physics-based animation is becoming increasingly popular due to its use in computer games, for producing 
special effects in movies, and as paid of surgical simulation systems. Over the past decade the field has 
matured, and today there is a wealth of simulation methods solving many simulation problems. There is a 
vast amount of examples where physics-based animation is used, e.g., rigid bodies stumbling around (The 
Hulk®, Grand Turismo®, Medal of Honor®, Half-Life®); skin and muscle deformations (Shrek®, the 
Nutty Professor®, Jurassic Park®, the Mummy®); water splashing (Shrek®, Titanic®, Finding Nemo®); 
jelly blobs dancing around (Flopper®); death-like animations (Hitman®); hair blowing in the wind or 
bending due to motion of a character (Monsters Inc®); cloth moving (Monsters Inc®); and melting robots 
and cyborg parts of characters (Terminator3®, Treasure Island®), just to mention a few. 

There is an ongoing quest for exploiting physics-based animation. While it was a computationally 
heavy burden 10-20 years ago to kinetically animate a linked character consisting of no more than a hand¬ 
ful of limbs [Wellman, 1993], today this is considered a trivial task due to the large increase of computer 
power. The increase of computer power allows us to simulate increasingly complex scenarios, in an appar¬ 
ently never-ending spiral, and it appeal's that there will always be a demand for faster methods,with more 
details, larger scenes, etc. Nevertheless, a major part of such animations will always be a physical model 
of the world constructed from the laws of physics and solved numerically. A major part of the physical 
models and their numerical solutions were developed prior to the twentieth century. Clever algorithms for 
solving geometrically-complex systems are less than 40 years old and still part of a very active research 
held. 

The widespread use of physics-based animation, especially in the entertainment industry, has resulted 
in a strong demand for education in these topics, both from students and industry. Quite a few books on 
computer animation such as [Watt et al., 1992, Parent, 2001] and computer games such as [Eberly, 2000, 
Eberly, 2003b] introduce the reader to physics-based animation, but these books often refrain from cov¬ 
ering advanced topics, either because they only consider real-time computations, or because of space 
limitations. The details of the simulation methods used in big commercial productions are usually not 
accessible by the general public, since they are kept as business secrets. Most of the advanced litera¬ 
ture on these topics is written as scientific papers addressing specific problems e.g., [Baraff et al., 2003a, 
Osher et al., 2003, Bergen, 2003a], which requires a broad basic education in physics, mathematics, and 
computer science to read and understand. Finally most “tricks of the trade” are not published in scientific 
papers, but learned through experience and word of mouth in the graphics community. 

These are the reasons that motivated us to write an advanced textbook on physics-based animation 
with a focus on the theory. We wanted to create an advanced textbook for computer science graduate 
students, teaching them the basis while at the same time teaching them the thin red line that separates the 
different simulation methods. Our pedagogical goal is to prepare readers for a commercial or academic 
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PREFACE 


career on the subject of physics-based animation, rather than to produce yet another code-hacker for the 
gaming industry. 

Almost all of the methods described in this book have been tested as implementation in the open source 
project OpenTissue. Major parts of this book and the OpenTissue project have been used in our classes 
during the last two years, from which we have gotten much feedback, and for which we arc very grateful. 
Special thanks should be given to Andre Tischer Poulsen, Katrine Hommelhoff Jensen, Anders Holbpll, 
Micky Kelager Christensen, and Anders Fleron who have served as our student reading committee. Andre 
Tischer has also given invaluable help with improving the visual appeal of many of the figures. We would 
also like to thank our ever-patient publisher Jenifer Niles from Charles River Media, together with her 
anonymous reviewing panel and technical writers. Finally, we wish to thank our respective families, 
without whose support, we would not have been able to complete this project. 

Kenny Erleben, Jon Spoiling, Knud Henriksen, and Henrik Dohlmann. 

30th September 2005 
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Introduction 


A long-term goal in computer graphics is to increase realism and believability in computer generated 
animations and pictures. Some original work includes [Armstrong et al., 1985, Terzopoulos et ah, 1987, 
Moore et ah, 1988, Hahn, 1988, Barzel et al., 1988, Platt et ah, 1988, Baraff, 1989, Barsky et ah, 1991]. 
The general belief is that as we get better and better at rendering images, the lack of physical realism 
and believability will be more obvious and therefore more annoying to the common observer. The main 
argument for achieving the goal of more realism has been to use physics to model the behavior and 
movement of computer models. Today these efforts have culminated in what is usually referred to as 
physics-based animation or modeling. 

Physics-based animation is a liighly-i n tc id i sc i p I in ary field, which is based on theories from engi¬ 
neering [Zienkiewicz et al., 2000, Stewart et al., 1996, Pfeiffer et al., 1996b, Enright et ah, 2002], from 
physics [ Baraff. 2001], and from mathematics. Some of the most noteworthy simulation models arc based 
on robotics [Craig, 1986, Featherstone, 1998] and solid mechanics [House et ah, 2000, Teran et ah, 2003]. 
To our knowledge, the field of physics-based animation was first named in a course in the 1987 ACM SIG- 
GRAPH (the Association for Computing Machinery’s Special Interest Group on Computer Graphics) con¬ 
ference, “Topics in Physically-Based Modeling” organized by Alan H. Barr. In recent years the emphasis 
in physics-based animation on computationally efficient algorithms has spawned a new field: plausible 
simulation [Barzel et ah, 1996]. 

In a movie production pipeline, it is generally believed that using physics inhibits the creativity and 
aesthetics of an animator. The reason is quite obvious: it is hard to be true toward a physical model, while 
at the same time using every aspect of the Principles of Animation [Lassiter, 1987, Frank et ah, 1995]. 
While some of the principles focus on implementing the physical reality such as squash and stretch, 
timing and motion, and slow in and out, most of the principles arc about exaggerating motion to entertain 
an audience and keeping the audience’s attention. Hence, animators have a preference for surrealistic 
movement in lieu of entertainment, and they have no shame in putting their characters in unnatural poses. 
They often apply a lazy approach to render only what is visible, as long as the motion and the scene 
conveys an interesting story. 

This book is not concerned with the Principles of Animation, nor does it attempt to qualify or quantify 
believability or plausibility of the presented methods and algorithms. It is not concerned with the ren¬ 
dering process or animation systems. Instead, this book is first devoted to giving the reader a firm and 
solid foundation for understanding how the mathematical models arc derived from physical and mathe¬ 
matical principles, and second, how the mathematical models arc solved in an efficient, robust, and stable 
manner on a computer. As such, the book introduces the reader to physics, mathematics, and numerics, 
nevertheless this book is not a substitute for text books in these fields. 
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CHAPTER 1. INTRODUCTION 



Data Acquisition 


Program 


Prediction 


Visualization 


Figure 1.1: Schematic overview of physics-based animation and simulation. The white arrows are typical 
of the engineering disciplines, whereas the black arrows are typical extra steps taken in computer graphics. 

1.1 The Computer Graphics Model 

One perspective of physics-based animation is that it consists of many theoretical laws and tools from 
physics and mathematics with some geometry added to it, and these are all mixed together to obtain a 
mathematical model of the real world. Once this model is obtained, it can be remodeled into a numerical 
model, which in turn may be programmed on a computer. The program is then able to make predictions 
about the real world, and these predictions may be used to estimate where objects are expected to move 
(forward kinematics and dynamics), or they may be used to calculate how objects should be handled in 
order to obtain some desired movement (inverse kinematics and dynamics). This view of the world is 
schematized in Figure 1.1 with the white arrows and is typical of the engineering disciplines. In engineer¬ 
ing, the goal is often to make highly accurate predictions about the real world in the long term e.g., to 
ensure the stability of a bridge or to predict the efficiency of a heating system. Using a surgical simulator 
as an example, we will describe typical steps covered by Figure 1.1: the initial point is typically some 
kind of data acquisition and modeling step, where a patient is scanned, and the resulting 3-dimensional 
image is segmented into meaningful anatomical objects and represented as geometrical shapes. These 
objects are then augmented with relevant physical laws often in the form of an energy system. To perform 
a physical simulation, the energy system is analyzed with mathematical tools such as calculus of variation, 
and the system is converted into a set of partial differential equations. The mathematical model is often 
a set of partial differential equations together with a detailed description of geometry acting as boundary 
conditions. To implement the mathematical model on a computer, the equations are often discretized e.g.. 
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using the appropriate finite differences method, which in turn can be programmed in C++ or a similar 
programming language. Finally, the simulator is able to be run, and be used to make predictions about the 
future, i.e., what will happen if certain structures arc changed or removed. 

In spite of the impressive accomplishments of physical, mathematical, and engineering disciplines, it 
is often the case that neither the mathematical model nor the numerical techniques are useful for computer 
animation. Computer animation favors high visual detail, high performance, and low memory consump¬ 
tion. In contrast, the engineering approach is often not interested in momentary visual details, since its 
primary goal is often to predict long-term behavior and/or high precision. Hence, computation time and 
memory consumption is often high, since if the task is to predict whether a bridge will stand for 100 
years, then it might easily be justified to have tens or hundreds of computers work on the problem over 
several days or weeks. In computer animation however, we would rather be able to visualize the crash of 
a bridge with visual appeal on a laptop computer in real time. Due to these differences between engineer¬ 
ing and computer graphics, computer scientists remodel the mathematical models to favor visual details, 
and choose numerical techniques favoring speed, robustness, and stability over accuracy and convergence. 
These tradeoffs may be called the computer graphics model. To summarize, the goals of physics-based 
animation is to model the physical world, but in contrast to engineering disciplines, the computer graphical 
approach favors visual flair and low computational resources. 

1.2 The Taxonomy of Physics-Based Animation Methods 

At the highest level, the field of physics-based animation and simulation can roughly be subdivided into 
two large groups: 

Kinematics is the study of motion without consideration of mass or forces. 

Dynamics is the study of motion taking mass and forces into consideration. 

But the story does not end here, because kinematics and dynamics come in two flavors or subgroups: 
Inverse is the study of motion knowing the starting and ending points. 

Forward is the study of motion solely given the starting point. 

In the first subgroup, one typically knows where to go, but needs to figure out how to do it. As an example, 
one might know the end position of a tool frame of a robot manipulator, without knowing what forces and 
torques to apply at the actuators in order to get to the final destination. In other words, inverse kinematics 
and dynamics computes the motion “backwards”. Forward kinematics and dynamics work exactly the 
opposite. Using the same example as before, one typically knows the starting position of the tool frame 
and the forces and torques acting through the actuators. The goal is then to predict the final destination. 
The difference between inverse and forward kinematics/dynamics is illustrated in Figure 1.2. 

There arc numerous techniques and methods in physics-based animation and simulation, and you 
probably already have heard several buzzwords. In Table 1.1 we have tried to classify some of the most 
popular techniques and methods according to the four subgroups introduced above. 
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Time- 


Figure 1.2: Example showing the difference in forward and inverse kinematics and dynamics. In the case 
of inverse motion, the starting and ending positions of the robot gripping tool is given, whereas in the case 
of forward motion, only the starting position is given. 



Inverse 

Forward 

Kinematics 

• Cyclic Coordinate Descent 

• Jacobian Method 

• Spline Driven Animation 

• Key Frame Animation (Interpolation) 

• Closed-Form Solutions 

• Free-Form Deformation 

Dynamics 

• Recursive Newton Euler 
Method 

• Optimization Problems 

• Featherstone’s Method (The Articulated- 
Body Method) 

• Composite-Rigid-Body Method 

• Particle Systems, Mass-Spring Systems 

• Finite Element Method 

• Constraint-Based Simulation 


Table 1.1: Common methods and their classification into inverse/forward kinematics/dynamics. 
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1.3 Scientific Computing versus Computer Graphics in Practice 

The engineering methods are dominated by the field of scientific computing. Looking at some of the 
applications, it becomes clear that the diversity is large. Examples of current simulators for real projects 
follow. Since the authors arc Danish, most of the examples arc Danish, however, we strongly suspect that 
the range of problems and their computational demands arc international. 

Numerical wind tunnels at RISOE: 

The numerical wind tunnel at RISOE (www. risoe . dk/vea-aed/numwind) uses a hyperbolic 
grid generator for both two-dimensional and three-dimensional domains. Navier-Stokes is solved 
using EllipSys2D/3D code, which is an incompressible finite volume code. The computation of a 
stationary windmill rotor takes roughly 50 CPU hours. Using 14 CPUs (3.2GHz Pentium M, 2GB 
RAM) in a cluster the simulation takes about four hours. Nonstationary computations take three to 
four times longer [Johansen, 2005]. 

Deformation of atomic-scale materials: 

In atomic-scale materials physics simulations of plastic deformation in nano crystalline copper arc 
done on a cluster (Niflheim, 2.1 Teraflops) of everyday PCs. A simulation involves 100 nodes 
running in parallel and often takes weeks to complete (www.dcsc.dk). In quantum chemistry, 
problems often involve 10 9 variables and simulations can take up to 10 days or more to complete 
[Rettrup, 2005]. 

Computational fluid mechanics 

Fluid mechanics involves a wide range of flow problems. Three dimensional nonstationary flows 

typically require 10 -100 x 10 6 grid nodes and use up to 1000 CPU hours per simulation (www. 

dcsc . dk). Solid mechanics are also simulated (www. dcsc . sdu . dk) but often not visualized 
[Vinter, 2005], 

Computational astrophysics: 

Computational astrophysics (www. nbi . ku . dk/side22 73 0 . htm) involves simulating the for¬ 
mation of Galaxy, Star; and Planet. Smoothed Particle Hydrodynamics and Adaptive Mesh Re¬ 
finement arc some of the methods used. Computations often take weeks or months to complete 
[Nordlund, 2005]. 

Weather reports: 

Weather reports arc simulated continuously and saved to disk regularly at DMI (www. dmi . dk). 
Information is saved every three hours. Forty-eight-hour reports arc computed on 2.5 grid nodes 
using time-step size of two minutes. The total number of computations arc in the order of 10 12 and 
solved on very large supercomputers [Sprensen, 2005]. 

From this short survey of scientific computing it is evident that the simulations rely on large supercomput¬ 
ers, often in clusters. The amount of data is astronomical and computation times that cover a wide range 
from minutes to hours, from hours to days, weeks, and even months arc not unheard of. Visualization 
ranges from simple arrows illustration flow fields to quite advanced scientific visualization. 
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A noteworthy point is that the concept of real time is often not a very useful term in scientific comput¬ 
ing. For instance, in chemistry, simulated time is the order of pico seconds but the computation takes days 
to complete. The main idea of the simulations is often to see a chemical process in slow motion in order 
to observe what is happening. This is not doable in a laboratory. In contrast, astro and sea simulations arc 
too slow and arc therefore simulated at higher rates. In sea-flow simulations a couple of hundred years 
arc simulated per day. In conclusion, turn-around time in scientific computing is often of the order of 24 
hours to 30 days [Vinter, 2005]. 

Looking at physics-based animation from a computer graphical perspective gives a slightly different 
picture. 

Fluid phenomena: 

Smoke simulation for large-scale phenomena [Rasmussen et al., 2003] using half a million parti¬ 
cles takes 2-10 secs, of simulation time per frame, and rendering time takes 5-10 minutes per 
frame (2.2 GHz Pentium 4). In [McNamara et al., 2003], key frame control of smoke simulations 
takes between 2-24 hours to perform on a 2 GHz Pentium 4. Target-driven smoke simulation 
[Fattal et al., 2004] on a 2.4 GHz Pentium 4 in 2D on a 256 2 grid takes 50 secs, for a 1 second 
animation in 3D on a 128 3 grid it takes 35 minutes. 

Particle systems: 

In [Feldman et al., 2003] suspended particle explosions are animated. The simulation time ranges 
from 4-6 secs, per frame on a 3GHz Pentium 4. With render times included, a simulated second is 
in order of minutes. Animation of viscoelastic fluids [Goktekin et al., 2004] using a 40 3 grid runs 
at half an hour per second of animation on a 3 GHz Pentium 4. Animation of two-way interaction 
between rigid objects and fluid [Carlson et al., 2004] using a 64 x 68 x 84 grid on a 3GHz Pentium 
4 with 1 GB RAM takes 401 simulation steps for one second of animation with average CPU time 
per simulation step of 27.5 secs. 

Cloth animation: 

Robust cloth simulation without penetration [Bridson et al., 2002] of a 150 x 150 nodes piece of 
cloth runs at two minutes per frame on a 1.2 GHz Pentium3. Untangling cloth [Baraff et al., 2003b] 
for a cloth model with 14 K vertices yields additional cost of 0.5 secs simulation time per frame 
on a 2 GHz Pentium 4. Changing mesh topology [Molino et al., 2004] during simulation for 1 K 
triangles runs at 5-20 minutes per frame, 380 K tetrahedra runs at 20 minutes per frame. Stacked 
rigid body simulation [Guendelman et al., 2003] with simulations of 500-1000 objects takes 3-7 
minutes per frame on average. 

Computer Games 

Game consoles such as Play Station 2 (www. playstation . com) has only 32 MB RAM and 6.2 
GFLOPS and common home PCs are for the most part inferior to the computers used by computer 
graphics researchers. This has called for some creative thinking in computer game development to 
reach the performance requirements. Furthermore, there must be time set aside for other tasks in 
such applications as computer games. For instance, in Hitman from IO-Interactive, only 5-10% of 
the frame time is used for physics simulation [Jakobsen, 2005]. The future may change these hard¬ 
ware limitations. For instance, recent GPUs do not suffer from latency problems and promise 60 
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GFLOPS (www. nvidia . com), also PPUs (www. ageia . com) seems to be an emerging tech¬ 
nology. Finally cell chips (www. ibm. com/news/us/en/2005/02/2005_02_08 .html) 
are also promising. 

From this short literature survey of recent work on physics-based animation in computer graphics, it is 
evident that frame times ranging from the order of seconds to hours running on single PCs arc used. 
In conclusion, design-work both in computer graphics and scientific computing requires reasonably low 
turn-around times. 


1.4 Future Points of Study 

Physics-based animation is a large field, and it is growing rapidly. Every year new techniques arc presented 
for animating new phenomena, and existing methods arc improved both in speed, accuracy, and visual 
detail. Covering everything is not possible in one book, needless to say that keeping up-to-date is an 
important job of any physics-based animator. A good place to start for industry standards arc the Physics 
Engines, many of which arc shown in Table 1.2. 

In the following we will briefly mention some topics we find interesting, but will not cover in this 
book due to space limitations and for pedagogical reasons: 

Recursive methods for jointed mechanics. 

Recursive methods for jointed mechanics arc also called minimal coordinate methods , and the theory 
is extensively covered in the literature, see e.g., [Featherstone, 1998]. 

Mathematical programming. 

Mathematical programming is the study of, among other things, complementarity problems. The 
field has a history going back to the 1940s. It is a huge field and would justify several textbooks in 
itself. In this book we recommend using an existing library such as [Path, 2005] during a course or 
confer with e.g., [Cottle et al., 1992, Murty, 1988], 

Deformable objects with nonlinear anisotropic material properties. 

Nonlinear and large deformations arc still very much a problem of ongoing research and not some¬ 
thing we feel fits into an introductory textbook. 

Cutting and fracture of deformable objects. 

Recently there has been published work on fracture, but both cutting and fracture is just on the brim 
for today’s real-time applications. 

Particle level set methods and adaptive data structures for water simulation. 

We have chosen to focus on the more classical approach, giving the reader a good understanding 
of the basics. It should not be difficult to proceed with the recent literature after this, see e.g., 
[Osher et al., 2003], 

Control in physical-based animations. 

Recently, attempts have been made to include animator control by using inverse dynamics ap- 
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Name 

Notes 

DynaMechs 

Development started in 1991 by S. McMillan, 

(dynamechs.sourceforge.net) 

Renderware Physics 

Business unit of Criterion Software established in 1993, 

(www.renderware.com/physics.asp) 

Havok 

Founded in 1998, (www. havok . com) 

Meqon 

Started as a university project in 1999, company founded 
in 2002 by D. Gustafsson and M. Lysen, (www. meqon. 
com) 

Ipion 

Bought by Havok in June 2000, (www. ipion . com) 

Open Dynamics Engine 

Seems to have been around somewhere between 2000- 
2001, started by R. Smith former MathEngine employee, 

(www. ode . org) 

Novodex 

Started spring 2002 by A. Moravansky and M. Muller, 

(www. novodex. com) 

Tokamak 

D. Lam is the original author of Tokamak, appeal's to have 
started in 2003, (www. tokamakphysics . com) 

Newton Game Dynamics 

Appeal' to have been around since 2003, (www. 

physicsengine.com/) 

Karma 

Developed by MathEngine, which was acquired by Crite¬ 
rion Software in July 2003, (www. mathengine . com) 

Vortex 

Developed by CMLabs, (www. cm- labs . com) 

Free Cloth 

Appeal's to have stalled around 2002 by D. Pritchard, 

(sourcef orge . net/pro jects/freedoth) 

OpenTissue 

Stalled November 2001 by K. Erleben, H. Dohlmann, 

J. Spoiling and K. Henriksen, (www. opentissue. 
org) 

AGEIA 

Company Founded 2002, March 2005 AGEIA announces 
a new class of physics processing unit (PPU) PhysX, 

(www. ageia . com) 


Table 1.2: A list of commercially and publicly available physics engines. 
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proaches. We have omitted all theory and methods regarding how to control and manipulate an 
animation, since we do not regal'd these as part of a low-level physics engine. 

Continuous collision detection. 

Tunneling effects and overshooting are some tedious problems in animation. One remedy for these 
problems is continuous collision detection, which promises better contact point generation and de¬ 
termination of contact areas in dynamic simulations. The field is rather new but very promising. 
However, in our opinion there is not yet a firmly established standard, and we have therefore omit¬ 
ted this material. Interested readers may refer to [Redon et al., 2002, Redon, 2004a, Redon, 2004b, 
Redon et al., 2004a], 

Simplex methods for collision detection. 

Simplex-based methods for collision detection are based on the mathematical concept of an affine 
lineal' independent set of variables. It is our experience that these methods are difficult to approach 
for students and not essential, which is the main reason why we have omitted these methods. Good 
references are [Bergen, 2003b, Bergen, 2001, Bergen, 1999]. 


1.5 Readers Guide 

Having taught advanced computer graphics and animation for several years we found that we lacked a 
thorough introduction to physics-based animation. Therefore we started writing notes first on the mathe¬ 
matical tools needed for the simulators, and then notes describing the various animation methods. Hence, 
the primary recipients of this book are graduate students in a computer science department. The back¬ 
ground of our students is typically well-versed in the art of programming, introductory computer graphics, 
and what we call mathematical maturity. By mathematical maturity we mean that students don’t necessar¬ 
ily have a rigorous mathematical training, but that they are eager to learn new mathematical techniques. 
Our approach is then practical: we strive to teach the students theories such as presented in this book, and 
we emphasize that they must be able to implement the theories in actual working simulators. We hope the 
book reflects these goals, e.g., by the numerous pseudocode examples. Almost all algorithms described 
have been implemented in the accompanying open source project [OpenTissue, 2005]. 

The book contains five parts reflecting the diversity of the field of physics-based animation. There are 
therefore several ways to read this book: 

Geometry of motion: 

For the reader interested in the geometry of motion and kinematics, and those who wish to have 
total control of motion patterns, we suggest Part I, which describes the design of motion patterns 
regardless of the physics of the real world. 

From particles to rigid bodies: 

The most common starting point of physics-based animation is to begin with particle systems (Chap¬ 
ter 8), followed by a study of rigid body animation in Part II, then an advanced investigation into 
deformable objects in Part III, and ending with a study of Computational Fluid Dynamics in Chap¬ 
ter 11. 
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Computational fluid dynamics: 

Some readers will find that Computational Fluid Dynamics is simple and choose to read Chapter 8, 
followed by Computational Fluid Dynamics in Chapter 11, and Continuum Models with Finite 
Differences in Chapter 9. It is our experience that finite element analysis in Chapter 10 poses the 
biggest challenges for novices. 

Collision detection: 

Collision detection covered in Paid IV is often treated in parallel with any of the above subjects. 

Mathematical and physical compendium: 

Paid V is written as a very extended appendix, which contains much of the physics and mathematics 
needed for understanding the theories in this book. 


Flappy reading! 
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Kinematics is the study of the motion of parts without considering mass or forces. Newton’s laws, 
therefore, arc neglectable. The main application of kinematical methods is preconligured animation, where 
the physical simulation is not required, too complex to bother with, or does not live up to the Principles of 
Animation [Frank et al., 1995, Lassiter, 1987], Many computer films rely heavily on kinematics for mo¬ 
tion; much of the theory may thus be seen as a natural extension to the stop-motion techniques developed 
by Disney in the early twentieth century. Kinematics arc divided into forward kinematics and inverse 
kinematics, depending on whether the end configuration is given or not. 

Forward kinematics is characterized by the motion being prescribed by the user or some function. 
Therefore, it is often also called direct kinematics. Forward kinematics requires much manual labor; nev¬ 
ertheless, direct kinematics is widely used in movie production. Here animators specify every parameter 
as a function of time. Often the animators act out the motion of the figures they arc about to animate. To 
reduce production time, a computer vision technique called motion capture is often applied. 

Motion capture is the process of tracking annotated points on an actor via video, reconstructing the 
points location in three dimensions, and using the reconstruction as input to direct kinematics. With this 
tool, animators can create astonishing realistic-looking motion that obeys the laws of physics. Motion 
capture is a great tool for movie production, where the reconstructed motion is used to specify the param¬ 
eters for a specific motion. Motion capture may also be used as a tool for creating new motion sequences 
for computer games. In this case, the motion capture data is seen as a discrete sample of physical motion 
and the new motion sequence is obtained by fitting a continuous model to the discrete data. One such 
technique is called motion blending or animation blending. Blending techniques arc often found in game 
engines such as those listed at www.devmaster.net/. There arc three basic problems when doing 
motion blending: 

Motion Time-Warping: How to match to motion sequences so they arc aligned in time. In other words, 
one has to figure out what frame in one motion corresponds to what frame in another motion. 

Motion Alignment: One motion may be heading left and the other right, so what position should the 
blending motion have? 

Motion Constraint Matching: If we want to blend a running and walking motion, then we should care¬ 
fully match frames where the corresponding feet touch the ground, in order not to make a sliding or 
floating motion result. 

Motion blending is used in computer games to create motion transitions between motion-captured data 
and is seen in many third-person shooter games such as Hitman® etc. The techniques arc often ad hoc and 
rely heavily on animators to completely specify motion transitions a priori. Recent research focuses on 
making the motions blend more naturally by taking the balance of a figure into account. 

Parameters for forward kinematics may also be generated using motion programs [Barsky et al., 1991], 
In motion programs, joint parameters arc given as some continuous function of time, such as cos (t), which 
generates an oscillating motion. Motion functions have to be carefully designed for the specific motion 
such as biped gait walking [Parent, 2001, Chapter 6.2] or fish swimming [Terzopoulos et al., 1994]. 

Inverse kinematics deals with the interpolation of a figure from a specified stalling and ending point. 
This is also often used in animation tools such as Maya®or 3DMax®. Even for simple objects such 
as the human arm. there will almost always be several solutions, since the elbow is movable even when 
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the hand and the shoulder arc fixed. Hence, a central problem in inverse kinematics is to select one out 
of several possible motions. A solution is often sought that generates a natural looking motion. This is 
achieved either by augmenting the inverse kinematics with extra constraints or combining with dynamics; 
for instance, using minimum energy principles. 

Finally, forward and inverse kinematics arc often combined in real applications. For example, direct 
kinematics may be used to give natural looking motions for some objects and inverse kinematics for 
others. Another example is to use motion capture to generate statistics on realistic motions and use this 
information to select solutions for inverse kinematical problems. Many physics engines further support 
the mixture of forward and inverse kinematics with physically simulated objects. A typical example in 
a movie production is to use inverse kinematics or physics-based animation to determine a rough set of 
parameters for a given motion. Then the animation is computed in detail, where parameters for all objects 
in each frame arc recorded. This is called baking [Kacic-Alesic et ah, 2003]. After having baked the 
animated objects, an animator can fine-tune the motions using the direct kinematics method to produce 
entertaining animations. In conclusion, in real applications, techniques arc used interchangeably both in 
combination, as pre- and postprocessing steps. 

In Part I we describe the basics of forward and inverse kinematics of articulated figures in Chapters 2 
and 3, and the subject of scripted motion in Chapter 4. 
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This chapter describes articulated figures and their representation. This representation is used later for 
animation and simulation of articulated figures. An articulated figure can be thought of as a robot arm or 
a human arm made of a number of solid rods, which arc connected to each other by joints that can move 
independently. When all the joints move, the overall motion of an articulated figure can be very complex. 
The goal of this chapter is to present methods that make it easy to describe the kinematics of articulated 
figures, which will be used later for animation and simulation of articulated figures. 

The chapter is divided into three sections. Section 2.1 defines precisely what is meant by an articulated 
figure, and how it is constructed. The last two sections explain two different methods of describing 
articulated figures. Section 2.2 presents a general method named Paired Joint Coordinates, and Section 2.3 
presents a specialized method named Denavit-Hartenberg. 

2.1 Links and Joints 

An articulated figure is a construction made of links and joints. The different links arc connected by joints, 
which have some degree of freedom. A link can be thought of as a solid rod, which cannot change its shape 
nor length. Hence, a link is considered a rigid body that defines the relation between two neighboring joint 
axes, see Figure 2.1. A joint can be thought of as a connection between two neighboring links. A joint 
might have several degrees of freedom, i.e., it might rotate around one, two, or three axes, or it might 
translate along one, two, or three axes. 

An example of an articulated figure with one revolute joint is shown in Figure 2.1. One can think of 
the links and joints as the building blocks which make up an articulated figure. Examples of articulated 
figures arc: industrial robots, construction equipment, and the human skeleton. 

Links and joints arc numbered from 0,..., N, and an articulated figure always starts with joint q, 
which is fixed at some stationary base coordinate system. The numbering of the links and joints is very 
important. A joint inside an articulated figure, say joint , connects linki-i and linki, where linki- \ is 
closer to the base. 

The joint shown in Figure 2.1 is a revolute joint, i.e., it can rotate around one axis. Generally, joints 
arc named for what they can do: 

revolute joint: A joint that can rotate around one or more axes. Figure 2.2(a) shows a revolute joint with 
one degree of freedom, but in general a revolute joint can have up to three degrees of freedom, i.e., 
it can rotate around all three coordinate axes. 

prismatic joint : A joint that can translate along one or more axes. Figure 2.2(b) shows a prismatic joint 
with one degree of freedom, but in general a prismatic joint can have up to three degrees of freedom, 
i.e., it can translate along all three coordinate axes. 
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linki joint i linki-i 



joint axes 


joints 


base 


joint i+1 


End Effector 


Figure 2.1: An example of an articulated figure with one revolute joint. The revolute joint can rotate 
around one axis. The numbering of the joints is very important, i.e., joint; connects 1 ink, | and link, . 



jointi joint axis 



linki-i linkj. 

(b) prismatic joint (translation) 


Figure 2.2: Examples of different joints used to construct articulated figures. 
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Figure 2.3: The three coordinate systems associated with each linki'. BF,. IF,, and OF,. The origins and 
coordinate axes of the inner-frame IF „ and the outer-frame OF, are specified in the body-frame BF 

2.2 Paired Joint Coordinates 

As mentioned previously, an articulated figure is constructed of links and joints. Because each joint can 
either rotate or translate, the motion of the individual links and joints can be very complex. This is because 
the motion of joint j and linkj affect the motion of joint t and link., for i > j. An example of this is: if 
the base joint rotates or translates its motion affects all other joints and links of the articulated figure. 
Therefore, it is very complicated to describe the motion of an articulated figure in a coordinate system 
common to all joints and links. This difficulty can be overcome by introducing local coordinate systems 
for all joints and links, and establishing transformations between the local coordinate systems. 

An articulated figure can be described using the paired joint coordinates method [Featherstone, 1998]. 
The method is very general and it is based on associating three predefined coordinate systems with each 
link. For link, these coordinate systems are named: body frame {BF i), inner frame ( IF f), and outer- 
frame {OF,), and they are all associated with link,. The origins of the inner-frame and outer-frame are 
located on the respective joint axes. This makes it easy to transform entities between successive links, as 
we will show later. These coordinate systems are illustrated in Figure 2.3 and explained below. 

The Body Frame (BF f) : is a local coordinate system that is associated with linki. The geometry of linki 
is described in this local coordinate system. Generally, the origin and the axes of this coordinate 
system can be chosen arbitrarily, but in order to handle articulated figures easily, it is recommended 
that you choose the origin of BF, to be the center of mass of link,. Also, it is recommended that you 
choose an orthogonal coordinate system. If link, has some symmetry axes, it is recommended that 
you choose some of them as basis axes for the local orthogonal coordinate system. In the following 
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it is assumed that all coordinate systems arc orthogonal. 

The Inner Frame (IFi): is a local coordinate system that is associated with linki. Usually, this coor¬ 
dinate system is associated with joints It has its origin located on the axis of joints and has one 
axis parallel to the direction of motion of the joint. Both the origin and the coordinate axes of this 
coordinate system arc specified in the body frame. 

The Outer Frame (OF j): is a local coordinate system that is associated with linki. Usually, this coor¬ 
dinate system is associated with joint i+1 . It has its origin located on the axis of joint i+l , and has 
one axis parallel to the direction of motion of the joint. Both the origin and the coordinate axes of 
this coordinate system arc specified in the body frame. 

The rest of this section contains subsections that derive transformations between the different frames 

1. Compute the transformation BFi Tjp i from inner frame to body frame (Section 2.2.1). 

2. Compute the transformation BFi ToFi from outer frame to body frame (Section 2.2.2). 

3. Compute the transformation OFi ~ 1 TiF i from inner frame of linki to outer frame of linki- i- 

4. Compute the transformation which transforms entities from the body frame of linki to the 

body frame of linki- i- 

The notation to Tfrom means that the transformation transforms an entity given in frame FROM 
coordinates to coordinates in the TO frame. 

When all these transformations arc derived, it is possible to transform the coordinates of a point p 
specified in the body frame of link , to coordinates in any other body frame, e.g., the body frame of linkj. 
This is very general. But it makes it easy to transform the coordinates of a point p specified in any body 
frame of linkj to the base coordinate frame, the body frame of link o- This makes it possible to transform 
positions, orientations, velocities, and accelerations of a point p, specified in any body frame to the base 
frame, which can be used as a coordinate system common to all joints and links. 

2.2.1 The Transformation BFi T IF . 

This section describes how to make a mapping from the inner frame to the body frame. To be more 
specific, the problem is: given a point p in inner frame coordinates, determine its coordinates in the body 
frame. 

With the definitions from Section 2.2 for linki, l et °if,, Uf,, j if ■< and kjp, be the origin and the 
basis vectors of the inner frame expressed in the body frame, see Figure 2.4. 

The relation between the inner frame and the body frame is as follows: given a point p = [x,y,z\ T in the 
inner frame IF, its coordinates in the body frame, BF j, can be expressed by an angular rotation tp around 
some axis u followed by a translation r n? L . This transformation is denoted BFi TiF i , where the subscript 
IF i and the superscript BF i indicate that the coordinates of a point p specified in inner frame coordinates 
is transformed to body frame coordinates. 
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Figure 2.4: The mapping between the inner frame IF, and the body frame BFi for linki. The origin and 
coordinate axes of the inner frame IF, arc specified in the body frame BF,. 


In homogeneous coordinates the transformation from the inner frame to the body frame is given by a 
matrix 

BF ‘T IFi =T IFi (r IFi )R IFi (tp IFi ,u IFi ) (2.1) 

The translation and rotation matrices arc given below 


T IFi {r IFi ) 


RlF i (FlF i ,U IFi ) 


1 r IFi 
0 T 1 

ilFi JlFi k IFi 0 
0 0 0 1 


(2.2a) 

(2.2b) 


where the symbol 1 denotes a 3 x 3 identity matrix, and the vectors ij Fi ,j IF .,ki Fi , and 0 arc 3 x 1 
column vectors. This makes both matrices 4x4. Both the translation matrix and the rotation matrix 
arc constant, depending only on the relative position and orientation of the inner frame and body frame. 
Therefore, the resulting transformation BFi Ti Fi is also constant. 


2.2.2 The Transformation BFi T OF . 

In this section it is described how to make a mapping from the inner frame to the body frame. To be more 
specific, the problem is: given a point p in outer frame coordinates, determine its coordinates in the body 
frame. 

Recall the definitions from Section 2.2 for linki, and let oo Fi , ioFi, 3 of f an d koFi be the origin 
and the basis vectors of the outer frame expressed in the body frame, see Figure 2.5. A transformation 
can be specified between the outer frame OF, and the body frame, BF, by defining the origin and the 
basis vectors of the outer frame in the body frame coordinate system. Let this transformation be denoted 
BFl T of i, where the subscript OF, and the superscript BF \ indicate that the coordinates of a point p 
given in outer frame coordinates is transformed to body frame coordinates. 
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Figure 2.5: The mapping between the outer frame OF, and the body frame BF, for link,. The origin and 
coordinate axes of the outer frame OF, arc specified in the body frame BF,. 


In homogeneous coordinates the transformation from the outer frame to the body frame is given by a 
matrix 

BFi T OFi = T OFiirOF^RoFiiVOFi^OFi) (2.3) 

The translation and rotation matrices are given below 


T OF i(r OFi) 


RoFiiFOFi^UoFi) 


1 r OFi 

0 T 1 

i OF i j OFi k OFi 0 

0 0 0 1 


(2.4a) 

(2.4b) 


where 1 is a 3 x 3 identity matrix, and the vectors i of, . j OF ., k of and 0 arc 3 x 1 column vectors, which 
makes both matrices 4x4. Both the translation matrix and the rotation matrix arc constant, depending 
only on the relative position and orientation of the inner frame and body frame. Therefore, the resulting 
transformation BFi T of t is also constant. 


2.2.3 The Transformation OFi - 1 T IF .(d i , f,. u,) 

In previous sections the link transformations Bt l Tj Fi and BFi T of, were derived, see Sections 2.2.1 
and 2.2.2. These transformations arc local to the link in question, namely linki. In this section a transfor¬ 
mation from the inner frame of link, to the outer frame of linki -1 will be derived. Let this transformation 
be denoted OFi ~ 1 Tj F .. 

The links link, - \ and link, arc connected by joint,. The relation between the inner frame of linki 
and the outer frame of linki -1 is given by a joint-transformation consisting of a rotation (■ tpi,Ui ) and a 
translation di 

OF i-iT IFi (di, <Pi,Ui) = Ti(di)Ri(<pi,Ui ) (2.5) 

where f, is the rotation angle, u, is the rotation axis, and di is the translation vector, see Figure 2.6. To 
make the notation easier, the index i is omitted in the following two equations. This means that f , = f. 
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joint axis 



Figure 2.6: Linki-i and link l arc connected by joints and an associated joint-transformation OFi ~iT IFi . 
The figure shows a revolute joint, but the transformation 0Ft - 1 T j F . can be any transformation. 


Ui = u,y. u z ] T , and d, = [d x ,d y ,d z ] T . With this notation the translation matrix T(di) and the 
rotation matrix R(pi,Ui) arc given by 


T(di) 


R(<Pi,Ui ) 


i o o (4 
0 10 d y 
0 0 1 d z 
0 0 0 1 


ul + (! - u\)cp 

U X Uy{ 1 — Clf) + U Z Slf 
U X U Z { 1 — C<£>) — U y Sp 

0 


u z u.y{ 1 — cp) — u z sp 
ul + (1 - u 2 y )cp 
U y U z ( 1 — op) + U x sp 

0 


'U’x'U'ziX ~ Cp) + UySp 

U y U z { 1 — op) — U x sp 
u 2 z + {l- u 2 z )cp 

0 


0 

0 

0 

1 


(2.6a) 


(2.6b) 


where sp is shorthand for sin p, and cp for cos p. The parameters (d, t , p t , Ui) are denoted joint parame¬ 
ters. 
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2.2.4 The Transformation ■* 1 j T, (<i ( , (p t , Ui ) 


From Figure 2.6 it can be seen that the transformation between the body frame of link , and the body frame 
of linki -1 is given by 


= (^- 1 T 0 Fi _ 1 )( 0 ^- 1 T 7 Fi (d i ,^,w i ))( SFi T/ Fi ) 


-l 


(2.7) 


By specifying the transformation 1 ' T, ( d, , p, , ttj) for all links, i = 1,.... A r a transformation from link 
N to the base, linko can be written 


°T n = °Tjv(di, <pi, ui ,..., djv, pn, ujy ) 

= °Ti(di, <^i,'Ui) 1 T 2 (d2, p>2,u-2) ■ ■ ■ n ~ 1 Tn(c1n, <pn,un) (2.8) 

Let 6i denote the generalized joint parameters for joint i, i.e., 0, = (dj, ipi,Ui). Then (2.8) may be written 
in a more compact form 

°T n = °T n (9 i,..., 0 jv) = °T’ 1 (6> 1 ) 1 (T 2 (0 2 ) • • • (^-t) j , Ar (0 JV ) (2 .9) 

This is a very general approach. It is possible to use it, but it is not very easy to use. In the next section it 
is shown how the approach can be specialized, so it is much easier to use. 


2.3 Denavit-Hartenberg 

The Denavit-Hartenberg notation [Craig, 1986] is a very specialized description of an articulated figure. 
It is specialized such that each joint has only one degree of freedom: it can either translate along its 2 -axis 
or rotate around its 2 -axis. If a joint needs more degrees of freedom, several joints with link lengths equal 
to zero are located on top of each other. 

Because each joint has only one degree of freedom, the Denavit-Hartenberg notation is very compact. 
There arc only four parameters to describe a relation between two links. These parameters arc the link 
length , link twist, link offset, and link rotation. The parameters arc also known as the Denavit-Hartenberg 
par ameters. The Denavit-Hartenberg notation is a specialization of the Paired Joint Coordinates method 
for describing an articulated figure, see Section 2.2. 

Consider an articulated figure as the one shown in Figure 2.7. The Denavit-Hartenberg parameters of 
this articulated figure arc defined below and summarized in Table 2.1. The Denavit-Hartenberg parameters 
used to describe an articulated figure can be computed by the following steps: 

1. Compute the link vector a* and the link length a* = a*. 

2. Attach coordinate frames to the joint axes. 

3. Compute the link twist crj. 

4. Compute the link offset dj. 

5. Compute the joint angle <pi. 

6. Compute the transformation <l 1 ! T r , which transforms entities from linki to linki- 1 - 

The rest of this section contains subsections that describe how to perform each of the steps listed above. 




Figure 2.7: This figure shows all the symbols used for the Denavit-Hai'tenberg notation and the physical 
meaning of those symbols. It should be noticed that the straight lines marked with the symbols • arc 
parallel, and this is also true for the straight lines marked with the symbol o. This means that the line 
marked with • is parallel to the axis of joints and the line marked with the symbol o is parallel to the 
coordinate axis i. Use this figure as a reference. All the details arc explained below. 
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link length a t 
link twist on 


link offset d. 


joint angle <pi 


The perpendicular distance between the axes of joint i and joint i+1 . 

The angle between the axes of joint i and joint i+l . The angle oti is mea¬ 
sured around the ccj-axis. Positive angles are measured counterclockwise 
when looking from the tip of vector Xi toward its foot. 

The distance between the origins of the coordinate frames attached to joint 
jointi_i and joint; measured along the axis of joints For a prismatic joint 
this is a joint parameter. 

The angle between the link lengths o*_i and a*. The angle ipi is measured 
around the Zj-axis. Positive angles are measured counterclockwise when 
looking from the tip of vector Zj toward its foot. For a revolute joint this 
is a joint parameter. 


Table 2.1: Denavit-Flartenberg parameters. 


2.3.1 The [.ink Vector a, and the Link Length a* = ||aj|| 

Consider the axes of joint i and joint i+1 . The link length a, is the shortest distance between these axes. 
This can also be formulated as: the link length aq is the distance between the closest point p i on the axis 
of joint j and p l+ \ on the axis of joint i+l . Let the joint axes be given by the parametric vector equations 

li(s)=Pi + sUi (2.10a) 

h+i (t) = p i+ 1 + tu i+ i (2.10b) 

where p t is a point on axis of joints and u, is a direction vector of that axis. Analogous for p i+1 and 
Ui- |-i. This is shown in Figure 2.8. The following three sections describe three different methods for 
computing the link vector cq. 


2.3.1.1 Method 1: The Pseudonaive Approach 

The shortest distance, oq, between the axes of joint i and joint i+ , is given by the length of the vector 
which connects the axes and which is perpendicular to both of them. That distance can be computed as 
the dot product between the vector p i+1 — p i and a unit vector in the direction of u, x u 1+ \ 


a i = (Pi+1 - Pi) ■ 


Uj X U i+ 1 
Ui X w i+ i|| 2 


( 2 . 11 ) 


To find the locations of the points o, and o a where this distance exists, one can go some distance s from 
point Pi along joint axis i, and then the distance a, along the unit vector u., x u l+ \ / | u t x Uj+i|| , and 
finally some distance t along joint axis i + 1 to arrive at point p i+i . This results in the following equation 


Ui X Itj-(-I 

Pi + SUi + ttij -= p i+1 + tu i+ l =^> 

II Ui X "li j-f-l || 2 

Ui X Uj . 1 

SUj iu l+ 1 — Id 1 Pi Q>i Tj [p - 

||Ui X w i+ i|| 2 


(2.12a) 

(2.12b) 
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joint axes joint i+1 




\ / 


Figure 2.8: The shortest distance between the axes of joints and joint i+1 is given by the vector a*, which 
is perpendicular to both joint axes. The length of a* is denoted the link length. 


Computing the dot products of (2.12b) and the vectors u, and u l+ \, and recalling that the dot product 
between Ui x u,+\ and Ui and ttj+i vanish, the following equations in the unknowns s, t are obtained. 


S \\Ui\\l - t(Ui • u i+ 1 ) = (p i+1 - Pi) ■ Ui 
s(Ui ■ u i+ 1 ) - t \\u i+l \\l = (p i+1 - Pi) ■ Ui+1 

The solution s, t of this linear system of equations is given by 

(Pi+1 - Pi) ' ( u i llWi+llla - u i+l (Ui ■ 1H + i)) 

S = - 2 - 2 - 3 - 

\\Ui \\ 2 ||w.j+l || 2 - (Ui ■ u i+ 1 ) 2 

^ _ (p i+ 1 - Pi) ■ ( Ui(v,i ■ u i+ 1) - u i+ i WmWl) 

I* 2 2 ) 

l|tti|| 2 ||wi+i|| 2 - (ui ■ u i+ iy 


(2.13a) 

(2.13b) 


(2.14a) 

(2.14b) 
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Using (2.10) the points o, and o ai can be computed as 


. , , , (Pi+i - Pi) ■ (IK+ill? - w*+i(w< ' «i+i)) 

Oi = li(s) = Pi 4 -H- 7 T— =- - - Ui 


\Ui 


I'Ui+lll - (Ui ■ U i+ 1) 2 


°Oj — ^i+1 (/) — Pi+l + 


(Pi+1 - Pi) ■ ( Ui(Ui ■ u i+ 1 ) - U i+ 1 llttig) 


w,: 


||t*i+i||J - (w; • Hi+l) 2 


-Wi+l 


(2.15 a) 
(2.15b) 


2.3.1.2 Method 2: The Geometric Approach 

The vector, which is perpendicular to both joint axes is given by the cross product of the direction vectors 
ui x Ui + 1 , but it is not known where this vector is located on the joint axes. To find this location, one can 
go some distance s from point p, along the axis of joint and then go some distance k along Ui x u l+ \, 
and finally go some distance t along the axis of joint i+1 to arrive at point p i+i . This can be expressed in 
the following equation 

Pi + sui + kui x ttj+i = p i+1 + tu i+ 1 (2.16) 

This is a vector equation in three unknowns s, t, k. To solve this equation, first eliminate the unknown k 
by computing the dot product of (2.16) and vector ttj. 


Pi ■ Ui + sUi ■ Ui + k(ui x u i+ 1 ) • Ui = p i+1 ■ Ui + tu i+ 1 • Ui, (2.17a) 

Pi ■ Ui + sui ■ Ui = p i+1 ■ Ui + tu i+ 1 • Ui, (2.17b) 

S \\ui\\l - t(u,i ■ u i+ 1 ) = (p i+l - Pi) ■ Ui (2.17c) 

Second, eliminate the unknown k by computing the dot product of (2.16) and vector Ui + 

Pi ■ u i+ 1 + sui ■ u i+ 1 + k{ui x u i+ 1 ) • u i+ 1 = p i+l ■ u i+ 1 + tu i+ 1 • u i+ 1 , (2.18a) 

Pi ■ u i+ 1 + sui ■ u i+ 1 = p i+1 ■ u i+ 1 + tu i+ 1 • u i+ 1 , (2.18b) 

s{ui • u i+ 1 ) - t lltti+ill^ = (Pi +1 - Pi) ■ ttj +1 (2.18c) 


Finally, eliminate the unknowns s, t by computing the dot product of (2.16) and vector u 7 x u l+ \. 


Pi ■ (■Ui x u i+ 1) + SUi ■ (Ui x u i+ 1) + k(ui x u i+ i) ■ ( Ui x u i+ 1) 

= p i+1 ■ (■ Ui x u i+ 1 ) + tu i+ i ■ (■ Ui x u i+ 1 ), (2.19a) 


Pi ■ [Ui x u i+ 1 ) + k{ui x Wj+i) • (Wj x Mj+i) = p i+1 • x u i+ i), (2.19b) 

k ||Ui x Wj+il^ = (p i+ 1 - Pi) ■ (■Ui x u i+ i) (2.19c) 

The equations (2.17c), (2.18c) and (2.19a) form a system of three equations in the unknowns s, t, k. 

s \\ui\\ 2 2 - t(ui ■ u i+ i) = G Pi+l - Pi) ■ Ui (2.20a) 

s(ui ■ u i+ i) - t \\ui +1 \\ 2 2 = ( p i+1 - p.i) ■ u i+ i (2.20b) 

k \\ui x = (p i+ 1 - Pi) ■ (■Ui x u i+ 1 ) (2.20c) 


© 


© 


© 


© 



© 


“book” — 2005/9/30 — 15:44 — page 27 — #39 


£ 


© 


© 


2.3 PHYSICS-BASED ANIMATION 


27 


It can be seen that only (2.20a) and (2.20b) arc needed to solve for the unknowns s,t, and (2.20c) yields 
k directly. 

From (2.20c) it can be seen that the shortest distance between the axes of joint t and joint i+1 is given 
by the vector 


a, = 


(Ui X U i+ 1) 


(Pi+ 1 - Pi) • (Uj. X «i+l) 

||Ui X Wj+ill^ 

(Pi +1 - Pi) ■ ( Ui X U i+ i) Ui X u i+ i 


= a,- 


11 Mi X U i+ 1| 
Uj X tt i+ i 
| Mi X Mj+i|| 


|Mi X « i+ i| 


where the link length a t is given by 


a,- = 


(Pi+i -Pi)' ( u i x Mj+1) 

IMj X U i+ 1|| 


From (2.20a) and (2.20b) the unknowns s, t can be computed 


s = 


(Pi+l - Pi) ■ (Mi ||Mi+l||J - Mi+i(Mi • Mi+i)) 


Mi 


l|Mi+l 11? “ ( u i ' Mj+l) 2 


t = 


(Pi+l ~Pi)- ( Mi (Mi • U i+ 1) - M i+ i 11 Ui\\l) 


Ui 


l|Mi+l ||? - (Mi • Mi + l) 2 


Using (2.10) and the solution s, t the points o, and o ai can be computed as 


, , * , (Pi+i - Pi) ■ ( U i ||Mi + l|| - U i+1 (Ui ■ u i+ 1)) 

Oj = li{s) p, -- —-s—-—--- Ui 


Ui 


|M i+ i|| 2 - (Ui ■ U i+ 1) 2 


°ai ~ h+ 1(/) — Pi+l + 


(Pi+l - Pi) ■ ( Mi (Mj • M i+ i) - Ui +1 \\Ui\Q 


Ui 


||Mi+i|r - (Ui ■ u i+ 1 ) 2 


Mi+l 


(2.21a) 

(2.21b) 

(2.21c) 

(2.22a) 

(2.23a) 

(2.23b) 

(2.24a) 

(2.24b) 


The point a, is the origin of the coordinate system attached to the axis of joint + and the point o ai is the 
intersection between the axis of joint i+ i and the vector a* when it stalls at point o, . 


2.3.1.3 Method 3: The Analytic Approach 

The distance between two arbitrary points located on the axes of joint , and joint i+l respectively is given 
by the expression 

d(s,t) = y f (Zi+i(i) - h(s)) ■ (li+i(t) - Zi(s)) (2.25a) 

= \J(p i+ 1 + tUi +1 - Pi - sut) ■ ( p i+1 + tu-i +1 - Pi - su-i ) (2.25b) 
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The link length of linki, a t , is given as minimum distance between the joint axes. Therefore, a* is the 
solution to the minimization problem 


at = min d(s,t) = min (h+i(t) - h(s )) • (li+i(t) - h(s)) 

= min sj(p i+l + tu i+ i - p { - sui) ■ (p i+1 + tu i+ i - p { - sui) 
A necessary condition for (2.26a) to have an extrema is that it has the stationary points 

dd(s,t) _ (p i+ 1 + tu i+ i -Pj- suj) ■ Uj __ 

\j(Pi +1 + tu i+ 1 - Pi - sui) ■ {p i+1 + tu i+ 1 - Pi - SUi) 
dd(s,t) _ {p i+1 + tu i+l - p t -su.i) ■ u i+l _ n 

dt 


{Pi+1 + tU i+ 1 - Pi - SUi) ■ (p i+1 + tU i+ i - Pi - SUi) 


The requirement that (2.27) vanish is equivalent to their numerators being equal to zero 

{.Pi +1 + tu i+ 1 - Pi - su.i ) Ui = 0 
{Pi +1 + tu i+ i - Pi - sui ) • u i+ i = 0 
Rewriting this system of equations yields 

S \\UiWl - t(Ui ■ Ui+ 1) = (p i+1 - Pi) ■ Ui 
s(Ui ■ Ui+ 1) - t \\u i+l \\l = (p i+1 - Pi) ■ Ui+1 
The solution s, t of this linear system of equations is given by 

(Pi+i - Pi) ■ ( u i llwi+illa - U i+ i{Ui ■ u i+ i)) 

S = -2-2-3- 

\\ u i \\ 2 ll«i+ill 2 - ( u i ■ Ui+i ) 2 

_ (Pi+1 - Pi) • {Ui{Uj • Uj+ 1 ) - Uj+1 \\u t \\ 2 2 ) 

\\Ui\\l Wui+iWl - (Ui ■ Ui+ 1) 2 

Using (2.10) and the solution s, t the points o, and o 0i can be computed as 


Oi = li(s) = Pi + 


°ai ~ h+ l(t) — Pi+l + 


(Pi+1 - Pi) • (Ui \\Uj+l\\l - Ui+l(Uj • Uj+ 1 )) 

]\Ui\\l lltti+llla “ ( U i ‘ U i+ 1) 2 

{Pi+1 - Pi) • {UijUj -Uj+i) - Uj+1 \\Uj\fy_ ' 
1 \\Ui\\ 2 \\Ui+i\\ 2 - (lii ■ Ui+i) 2 


(2.26a) 

(2.26b) 

(2.27a) 

(2.27b) 

(2.28a) 

(2.28b) 

(2.29a) 

(2.29b) 

(2.30a) 

(2.30b) 

(2.31a) 

(2.31b) 


These two points arc the closest points on the axes of joint j and joint i+1 . Therefore, the link vector a; 
and the link length a* arc given by 


O : — (+i Oi 


® i — | 11 2 — || O a; Oj|| 2 


(2.32a) 

(2.32b) 
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The Link Vector a, 

The link vector a, is the vector that connects the two closest points on the axes of joint i and joint i+1 
respectively. Therefore, the link vector a, is the difference between the points o ai and o, from (2.31). 


a, = o n — o, 


, , (Pi+1 - Pi) ■ ( «i(«i ■ Ui+l) - «i+l \\Ui\\ 2 ) 

— Pi+1 + ,, 1,2 || 112 I \ 2 U i +1 

INIL IN+i|L - {ui ■ u i+l y 


, , (Pi+l-Pi) ■ {Ui\\u i+1 \\ o -U i+1 (Ui-U i+1 )) 

- P: ---—-—— --- u, 


\Ui\\ ll'Uj+lIL - (Ui ■ U i+1 ) 2 


_ N (P*+i “Pi) ■ (w*ll«i+i|| 2 -w*+i(«i-Wi+i)).. 
(Pi+1 Pi) •> •' - 11 ' 


\Ui\\ ||tti+l|| - (Ui ■ u i+ 1 ) 2 


+ 


(Pi+i - Pi) ■ (uyui ■ u i+ 1) - u i+ 1 \\mWl) 


Ui\\ 2 \\u i+1 \\l - ( Ui ■ u i+ 1 ) 2 


-Ui + 1 


(2.33a) 


(2.33b) 


(2.33c) 


In the next section it will be shown that the link vector a, is perpendicular to the axes of jointi and 
joint i+1 . 


The Dot Products a, • u, and a t ■ u ir \ 

In this section it is shown that the link vector a, is perpendicular to both axes of jointi an d joint i+l . The 
dot product between the link vector a,; and the direction vector ui of the axis of jointi i s equal to 


, , (Pi+t -Pi)' («i ||«i+t|| 2 - «*+i(«i • u i+ 1)) , , 

a i ■ u i = (Pi+1 - Pi) • Ui ---^---(«i ■ U i) 


|^i|L ||^+l|L 


(p i+ l - Pi) ■ ( Ui(Ui • U i+ 1) - 1*i +1 ||t*i|| ) 

“1“ ,, ,.o n „o , . « (^z+1 * 'U'i) 


= {p i+ 1 - Pi) ■ Ui - 


Ui W 2 \\u i+ i \\ 2 - (■ Ui ■ u i+ i ) 2 

(Pi +1 - Pi) ■ (Ui \\Uif \\u i+ i \\ 2 - U i+ 1 \\Ui\\ 2 (Ui ■ u i+ 1 )) 


(2.34a) 


+ 


\\ui\\: 2 ||lli+l \\ 2 - [Ui ■ U i+ 1 ) 2 
(Pi+ 1 - Pi) ■ ( Ui(Ui • U i+ 1) 2 - U i+ 1 11 Ui 11 2 ( Ui ■ U i+1 )) 


\ui\r ||ui+i||, - (ui ■ u i+ 1 ) 2 


= (Pi+1 - Pi) ■ Ui + (p i+1 - Pi 


'll, 


||tti||j ||tti+l||j + («i • Wi+i) 


|tti||; ||wi + i|r - ( ui ■ u i+ 1 ) 2 


= (p i+1 - Pi) • Ui - (p i+1 - Pi) • Ui = 0 


(2.34b) 

(2.34c) 

(2.34d) 
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Analogously, it can be shown that the dot product between the link vector a and the direction vector Wj+i 
of the axis of joint i+l is then equal to 0, i.e., a* • Wj+i = 0. 


a, 


u i+ 1 (Pi+ 1 Pi) ' u i+ 1 


(P<+1 “ P*) ‘ ( U i llWi+lllj “ u i+l( u * ' U i+l)) 


+ 


Milla ||«i+i||, - (Ui ■ U i+ 1) 2 

(Pi+1 “ Pi) ■ («i(«i ' u i+ 1) - u i+ 1 IK© , * 

-(M i+ 1 • U i+ 1) 


K ■ U i+ 1) 


= (Pi+1 - Pi) ' w i+l - 


Ui W 2 \\u i+ i\\ 2 - (■ Ui ■ u i+ 1 ) 2 

(Pi +1 - Pi) ■ ( Ui \\u i+ i\\ 2 ( Ui • U i+ 1 ) - U i+ i(Ui ■ U i+ 1 ) 2 ) 


(2.35 a) 


Ui 


I Ui-\- \ 11 (lij • 


+ 


(Pi+1 “Pi) • ( u i ll w i+lllo («< ' w i+l) - u i+l ll«i|la ll^i+lllo) 


= (p i+ i - Pi) ■ U i+ 1 + (p i+1 - Pi) ■ U i+ i 


Ui\\ 2 llWj+lllj - (Ui ■ u i+ 1) 2 

- ||tti+l||^ + (Ui ■ u i+ 1 ) 2 


Ui\\ \\u i+1 r - (Ui ■ u i+ 1 ) 2 


= (Pi +1 - Pi) ■ u i+ 1 - (p i+ i - Pi) • Wi +1 = o 


(2.35b) 

(2.35c) 

(2.35d) 


Thus, the link vector a* is perpendicular to both axes of joints joint , and joint i+1 . This means that there 
exists a plane tti with normal vector a, which contains the axis of joint, . Also, there exists a plane 7Ti + i 
with the same normal vector aj which contains the axis of joint i+l , see Figure 2.9. 


2.3.1.4 The Easy Way to Compute a t and | a, 


The link vector a* is perpendicular to both of the axes of joint, and joint i+1 as shown in (2.34) and (2.35) 
and in Figure 2.9. The unit vector c,; given by the cross product between the direction vectors u, and of 
the |_i of the joint axes 


Ui x u i+ 1 

Cj — II || 

IK X u i+ 1 || 2 


(2.36) 


is parallel to the link vector a,. 

Given two points p t and p, on the axes of jointi and joint i respectively, the link length can be 
computed as 


a.; 


= l(pi+l -Pi 


(Pi+l - Pi) ■ 


Uj X U i+ 1 
Ui X Wi+11| 2 


(2.37) 


and the link vector a* can be computed as 


Ui = ((p i+ 1 - Pi) ■ Ci)Ci 


((Pi+! - Pi) ■ 


Ui X Wi+i 
Ui X Wj+i|| 2 


Ui X Wi+i 
Ui X Wi+11| 2 


(2.38) 


These computations arc illustrated in Figure 2.10. It can be shown by simple but very many algebraic 
manipulations that the link vectors computed by (2.33) and (2.38) arc identical. 
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Figure 2.9: The axes of joint. L and joint i+1 arc contained in two parallel planes 7r; and 7r; + i respectively. 
The perpendicular distance between the planes tt z and tt, + i is equal to ||oj|| . 

The vector c, = u, x 'u,_ i / |'u z x u t+ \ | _ can either point from the axis of joint i to the axis of 
joint i+l or in the opposite direction depending on the orientations of the vectors u , and u., + \. If the 
vector c ? ; points from the axis of joint i+l to the axis of joints the dot product ip t+ \ — p,) • Ci is negative, 
and the link vector a z will get the right orientation. 

In the other case where the vector c t points from the axis of joint i to the axis of joint i+l , the one 
shown in Figure 2.10, the orientation will obviously be right. 

2.3.1.5 Special Case: The Joint Axes Intersect 

If the axes of jointi and joint i+i intersect then the shortest distance between the axes, a* is equal to zero, 
and the link vector is the null vector, which has any direction. In this case, choose the link length, a* = 0 
and the link vector to be equal to the cross product between the direction vectors, u., and Ui + \ between 
the axes of joint i and joint i+1 . That is, choose 

a, = 

Oi — 


Uj X u i+1 
| Ui X Ui + i|| 


0 


(2.39a) 

(2.39b) 
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Figure 2.10: An easy way of computing the link length ctj. Let the vector c be equal to the unit cross 

product between the direction vectors of the axes, i.e., the vector c, is given as c t = M * x ' l ' t,+1 ll , and let 

\\Ui xu i+ i || 2 

the points p i and p i+1 be arbitrary points on the axes, e.g., the points given in (2.10). Then the link length 
is equal to the dot product between the vector c t and the vector (p l+ \ — p,j. That is a* = (p %+ \ — Pi ) • c,. 


For this special case it is convenient to choose such a strange link vector that does not conform to (2.38), 
because the link length a* is a Denavit-Hartenberg parameter, and the link vector a, is used to attach the 
coordinate frame, the aq-axis of linki. Furthermore, the origin o, and the point o 0i arc uniquely defined 
as the intersection point. 

2.3.1.6 Special Case: The Joint Axes are Parallel 

If the axes of joints and joint i+i arc parallel, there is no unique shortest distance between the axes, see 
Figure 2.11. In this case, compute the link vector and the link length as follows 

a-i = (p i+1 -Pi) - ((p i+ 1 -Pi) - (2.40a) 

V W u i \\ 2 / ll"*ll 2 

a i = \\a,i\\ 2 (2.40b) 

where p, and p l+l arc known points, and u, and u l+ \ arc known direction vectors of the axes of joint i 
and joint i+1 , see (2.10). The origin o, can be chosen arbitrarily, and the point o ai is obviously given by 
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Figure 2.11: The joint axes arc parallel, so there is no unique shortest distance between them. 


o, and a,. Since the origin o, can be chosen arbitrarily, it might be clever to choose it such that most of 
the Denavit-Hartenberg parameters will be equal to zero. 

2.3.1.7 Special Case: The First Joint 

An articulated figure must start at some place, and therefore there is a problem with the very first joint, 
because there is no link preceding it. Therefore, a base link is introduced, denoted link o- The link frame 
for linko can be chosen arbitrarily, but it is clever to let it coincide with the link frame of link i when the 
articulated figure is in its rest position. Then most of the Denavit-Hartenberg parameters will be equal to 
zero. 


2.3.1.8 Special Case: The Last Joint 

An articulated figure must stop at some place, and therefore there is a problem with the very last joint, 
because there is no link following it. Generally, the coordinate frame of the last link can be chosen 
arbitrarily, because there is no physical link, i.e., the articulated figure stops at the axis of joint N , so there 
is no link vector a n- The only thing that is given is the axis of joint N which becomes the z y-axis. But it 
is clever to choose the origin oy, the link vector a ; y (the a;/y-axis), the link offset Ay, and the link twist 
a y so that most of the Denavit-Hartenberg parameters arc equal to zero. 

2.3.2 Coordinate-Frame Attachment 

Given an articulated figure, the first thing to do is to attach one coordinate system to each link. These 
coordinate systems are called link frames. The procedure for attaching the link frames is as follows: 
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joint i 

ini 

P 


joint i+1 



Figure 2.12: The joint axes of an articulated figure. The axes arc given by the parametric equations 
li(s) = Pi + su t . Think of the joint axes as the vectors Uj. 

1. Identify the joint axes. 

2. Identify the common perpendiculars of successive joint axes. 

3. Attach coordinate frames to each joint axis. 

In the following will each of the above steps be performed. 

First, identify the joint axes of the articulated figure (see Figure 2.12). The axis of joint i is given by 
(2.10a), and repeated here for convenience. 


k(s) = Pi + sui (2.41) 

Next, identify the common perpendicular between neighboring joint axes. That is, identify the com¬ 
mon perpendicular a* between the axes of joint i and joint i+1 . Also, identify the point o, where the 
common perpendicular intersects the axis of joint This is illustrated in Figure 2.13. 

In Section 2.3.1.3 we showed how to compute the shortest distance a* between the axes of joint j and 
joint i+l . We also showed that the shortest distance is along the common perpendicular between the axes 
of jointi and joint i+1 . Finally, we showed how to compute the location of the shortest distance o, on the 
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joint t 


joint i _ 1 


joint i+l 



axis of joints What is needed here is the link vector a f and its intersection o, with the axis of joints see 
(2.31) and (2.32a) in Section 2.3.1.3. 

Finally, the i th link frame, shown in Figure 2.14, can be constructed in as follows: 

1. The Origin: Let the origin of the i th link frame be at the point o, on the axis of joints 

2. The Zj-axis: Let the Zj-axis be along the i th joint axis. That is, let the z t be parallel to vector Ui 
from (2.41) 



(2.42) 


3. The ./•-axis: Let the ax is be along the link vector a* from (2.32a) 


Xi = 



(2.43) 


4. 


The y ( -axis: Let the .'/:,-ax is be such that the vectors x t , y t . z,. form a right-handed orthogonal 
coordinate system. That is, let y , be given as 


Vi = 


Zj x Xj 
Zi X Xi\ 


(2.44) 
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jointi 


joint i-i 


joint i+1 



2.3.3 The Link Twist a* 

The link twist is the angle a* between axes of joint,; and joint i+1 . The angle aj is measured around the 
tCj-axis. Positive angles are measured counterclockwise when looking from the tip of vector x, toward its 
foot, see Figure 2.15. More specifically, the link twist a* is the angle between the direction vectors of the 
joint axes Uj and u t+ \ measured around the link vector a*. 

Recall some properties of the scalar and vector products. 

Ui ■ u i+ 1 = ||uj|| 2 ||tti+i|| 2 cos oil o < < 7T (2.45a) 

||Ui x u i+ 1 || 2 = ||wj|| 2 ||tti + i|| 2 sin oti 0 < a t < 7r (2.45b) 

which is equivalent to 

Uj ■ UjM 1 

cos aii = -—-— -— 0 < a* < ir (2.46a) 

|| tTj || 2 || 2 

11 It; X Uj. 1.1 I 

sin oti = -—-— - rf- 0 < a.j < tt (2.46b) 

| Uj || 2 ||ltj-)-i || 2 

B—, , ,—© 


© 


© 
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Figure 2.15: The link frames and link twist a*. In the figure, the link twist a* is negative. 


Notice, since 0 < sin a* < 1 while —1 < cos a t < 1, the angle cti will always be in the interval 
0 < a* < 7T, see Figure 2.16. In the following we show how to compute the link twist a:, correctly. 

Recall that the mathematical function arctan : M — > M returns a value in the interval [—7r, it}. Let 
the function arctan2 : M 2 —> M be defined as 


arctan2 (n, d) 


< 


arctan 

arctan 

arctan 

arctan 



+ TT 

— TT 


if n > 0 A d > 0 
if n<0Ad>0 
if n>0Ad<0 
if n<0Ad<0 


That is, if the link twist a, is naively computed as 


/ ||Ui X U i+l \\ 2 Uj ■ U i+ 1 \ 

Vll w ill 2 ll«i+i|| 2 ’ I|m*II 2 ll«i+i|| 2 / 


(2.47) 


a* = arctan2 


(2.48) 
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Figure 2.16: The twist angle cc* is always positive and in the interval 0 < a-, < tt. Since 0 < sin a t < 1 
and — 1 < cos a, < 1 there are two possible angles a^ ] and o^ 2> depending on the sign of cos ctj. 


the result will sometimes be wrong, because sin a, from (2.46b) is always positive. That means, the angle 
a* will always be one of the two positive angles a[ l > or (x\ ] shown in Figure 2.16, i.e., a:, will always be 
positive. 

The angle a, in the cross product, (2.46b), is measured around the axis u r x u, ( |. If the vector 
Ui x Ui + 1 has the same direction as the link vector a, then the angle cc* in (2.48) is correct, see Figure 2.17 
If the vector itj x u l+ \ has the opposite direction as the link vector a* then the angle a.; in (2.48) is not 
correct. The correct angle a* is the negative of the angle in (2.48), see Figure 2.18. 

To distinguish between the two cases where the vectors Ui x u l+ \ and a* have the same or opposite 
directions, it suffices to check the sign of their dot product («j x u,+\) • a*. That can be expressed in the 
following equation 


+ arctan2 


OLi = < 


\Ui x « i+ i|| 2 

w *ll 2 ll u i+ill 2 

|Mi X «i+l|| 2 

Will, l|wi+i || 2 


Mj ' Ui -)_l \ 

m ;|| 2 ||Mi+i|| 2 y 

Mj ' Mj_)_l \ 

Mi ||„ ||Mi+i|| J 


if (Ui x u i+ 1 ) • a t > 0 
if ( Ui x u i+ 1 ) ■ a,i< 0 


— arctan2 


(2.49) 
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Ui X U i+ 1 



Figure 2.17: The twist angle is positive because the direction of the vector Ui x u, + \ has the same direction 
as the link vector a*. 


m X Ui +1 



Figure 2.18: The twist angle a. t is negative because the direction of the vector u, x u 1+ \ has the opposite 
direction of the link vector a L . 
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Figure 2.19: The link offset is the distance between the origins of the coordinate frames attached to joint 
joint i _ 1 and joint j measured along the axis of joints 


2.3.4 The Link Offset d{ 


The link offset is the distance between the origins of the coordinate frames attached to joint joint i _ 1 and 
jointi measured along the axis of joints More specific, it is the distance between the point o a/1 where 
the | -axis intersects the axis of joint t ■ and the origin o, of the link frame of linki, see Figure 2.19. The 
points o, and o ai arc given by (2.31) for any i. That is 


di 



Oi-O 

Oi-O 


a i -1 ll 2 
“i-l il 2 


if (oi - o ai _J • Ui > 0 
if (oi - o^.j) • Ui < 0 


(2.50) 


2.3.5 The Joint Angle <pi 

The joint angle ipi is the angle between the link vectors a,_i and a*. The angle tpi is measured around 
the Zi~ axis. Positive angles arc measured counterclockwise when looking from the tip of vector z* toward 
its foot, see Figure 2.20. The joint angle ipi is computed the same way as the link twist o- t , which was 
described in Section 2.3.3. The difference between the computations arc the vectors which arc involved. 
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Figure 2.20: The coordinate frames of the links and the joint angle. In the figure, the joint angle tpi is 
negative. 


The joint angle <pi is the angle between the link vectors | and a* measured around the axis z % 
which is parallel to the joint axis u t . Using the same arguments as in Section 2.3.3 the joint angle is 
computed as follows 


<Pi = < 


+ arctan2 ( - 


1^2—1 X Q'i 


V \\ a i-l || 2 ll a *ll 2 


O'i— 1 * 

|®2—1 || 2 II 


— arctan2 


/ ||Oj-i X Cli || 2 —CLi_^CLj—\ 

l|L II®* II o II®*—l|| 2 ||®*|L/ 


if 

if 


(®i-i X Cii) ■ Zi > 0 
(®i-l X CLi) ■ Zi < 0 


(2.51) 


2.3.6 The Link Transformation l , T, 

The transformation l) T, transforms entities from one frame to the preceding frame. This can be ex¬ 
plained as follows. Given the coordinates of a point p, specified in linkframe t , the transformation ! i ~ 1 ! T, 
transforms point p, such that its coordinates arc specified in linkframe i _ 1 . 

The idea is to transform the point p, with coordinates in linkframe i with a transformation that 
makes linkframe i _ 1 coincide with linkframe j. The transformation of the coordinates of point p, from 
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Figure 2.21: The coordinate frames of the links and the joint angle. 

linkframe.i to linkframe i _ 1 is done in several steps: 

1. Rotate the joint angle <pi around the axis z t . 

2. Translate the link offset d, along the axis z t . 

3. Translate the link length ctj_i along the axis x j. 

4. Rotate the link twist angle a*_i around the axis x t . 

That is. the transformation il ~ l ) T, has the form 

di, Oj_i, «i_i) = R Xi (ai-i)T Xi (ai-i)T Zi (di)R Zi (ipi) (2.52) 

A detailed description of the transformations R Xi (ai-\), T Xi (a,i-i), T Zi (di), R Zi (tpi) is given in the 
following sections. 

2.3.6.1 The Transformation R Zi (ipi) 

Consider Figure 2.21. The angle between the x\ and i is equal to ipi measured around the axis z*. So, 
by rotating the angle <pi around the axis z z we make the coordinate axes x t and Xi-i parallel. The actual 
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transformation is given as 




COS ipi 
sin ipi 

0 

0 


— sm (fi 
cos ipi 

0 

0 


After this transformation, the axes x^ and Xi-\ are parallel, i.e., Xi || |. 


(2.53) 


2.3.6.2 The Transformation T Zi (di) 


Still consider Figure 2.21, but remember that the axis x, has been rotated such that it is parallel to the axis 
Xi- 1 - By translating the distance d, along the Zj-axis the axes Xi and Xj-\ will not only be parallel, they 
will lie on the same line. The actual transformation is given as 


T Zi (di) 


10 0 0 

0 10 0 

0 0 1 di 

0 0 0 1 


(2.54) 


After this transformation, the axes Xi and x,- \ are not only parallel, but they are located on the same line 
i.e., Xi = Xxi-i. 


2.3.6.3 The Transformation T Xi (a,-\) 


Still consider Figure 2.21, but remember that after the previous transformations the axes Xi and \ are 
now on the same line. By translating the distance a,;_i along the .x,-axis, the origins o 7 ; and o 7 _ i will 
coincide. The actual transformation is given as 


T Xi (oi -1 ) — 


10 0 i 

0 10 0 

0 0 1 0 

0 0 0 1 


(2.55) 


After this transformation, the axes x t and x t - \ are on the same line, and the origins o ? ; and o,_ i coincide, 
i.e., Oi — Oi— \ 


2.3.6.4 The Transformation R Xi (oti) 

Still consider Figure 2.21, but remember that the axes x. t and x % - \ are now on the same line and that 
the origins o 7 ; and o,_ i are coincident. The only thing that needs to be taken care of is to transform the 
z 7 _ | -axis into the z 7 -axis. This is done by rotating the angle aj_ 7 around the axis x r = X\-\. The actual 
transformation is given as 

0 0 

cosa 7 _i — sina 7 _i 
sin a t -1 cosa 7 _i 
0 0 


Hxii&i— l) — 


(2.56) 


^3 
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After these transformations the two coordinate systems coincide and the final transformation (l l) T, from 

linkframe^ to linkframe i _ 1 is established. 

2.3.6.5 The Transformation (p t ) 

As shown in the previous sections, the transformation ■ is given as 

<pi, «i_i) = R Xi {ai-i)T Xi (ai-i)T Zi (di)R Zi (ipi) (2.57) 

Usually, the link parameters and a*_i arc constant for a given articulated figure, so the transformation 

('- 11 T, is really only a function of the two parameters d t and ipi. Multiplying the matrices yields the 
following expression for the transformation 1 ] T, 

(i_ 1 ) Ti(di,<Pi) = 

In some literature, it is often seen that the transformation ( t_1 ) T / is written in a different way. Omitting 
the parameters, the transformation has the form 

(i_1) Ti = R Xi T Xi T Zi R Zi (2.59) 

from which it can be seen that there arc only translation and rotation matrices. Furthermore, it can be seen 
that they can be written in two groups 

= (. R Xi T Xi )(T Zi R Zi ) (2.60) 

By inspection, it can be seen that this special matrix product commutes 

R Xi T Xi = T Xi R Xi (2.61) 

and that is because the matrix T Xi does only change the x-coordinatc, and the matrix R Xi does not change 
the x-coordinatc. 

By specifying the transformation ' l T,(di- fi) for all links, i = 1,..., N a transformation from link 
N to the base, link a can be written 

°T n = °TV(g?i, (pi ,..., d/v, <Fiv) = °T’i(di, ^i) 1 T 2 (d 2 , ^ 2 ) ■ ■ ■ N l TN{dN, <Pn) (2.62) 

Let 9i denote the generalized joint parameters for joint i, i.e., 9 t = ( d t . tp t ). Then (2.8) may be written 

°T n = °T N (9 U ...,9 n ) = o T 1 (0 1 ) 1 T 2 (0 2 ) • • • ( n ~Vt n ( 9 n ) (2.63) 

When using the Denavi t-Hartenberg description of an articulated figure, usually each joint is restricted 
to having only one degree of freedom. That is, each joint has only a translatory parameter di along its 
z-axis if it is a prismatic joint, or it has one rotational parameter tp % around its 2 -axis if it is a revolute 
joint. If a joint needs more degrees of freedom, several joints with link lengths equal to zero arc located 
on top of each other. 

This means that the generalized parameter vector 9, = (ipi,di) T becomes an ordinary parameter, i.e., 
the generalized parameter 9, becomes d, for a prismatic joint, and it becomes 93 , for a revolute joint. 


cos ipi — sinc^j 0 a*_i 

cosa^-i sin<£>j cos crj_i cos — sina^-i — d^sma^-i 
sin ai -1 sin ipi sin a j_ 1 cos 999 cos 1 d,; cos a 1 

0 0 0 1 


(2.58) 
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Kinematics is the study of the motion of rigid bodies without consideration of Newtonian laws. This and 
the following chapter describe methods of how to manipulate an articulated figure. Recall from Chapter 2 
the definition of an articulated figure, its notation, and how to navigate the structure. 

Forward kinematics , Section 3.2, is the most basic way to animate an articulated figure. All the joint 
parameters arc set manually and the result is the placement of all the links and joints in the whole figure. 
This method is not very practical if one wants to have the end-point of the articulated figure touching some 
specific point; for example, if a character in a computer game is to grab the door handle when opening the 
door. It is almost impossible to calculate the parameters of each joint in order to get the ami and hand in 
the position where they align with the door handle. 

An easier way is to use inverse kinematics , Section 3.3. This method, as the name suggests, starts with 
the end-point of an articulated figure and uses that to calculate each joint parameter. So an animator only 
has to worry about this end-point. In forward kinematics, one specifies each joint variable and the result 
is the end-point. In inverse kinematics the end-point is specified and the result is the joint variable. The 
end-point is most often called the end effector. This chapter starts by giving a description of the properties 
of this end effector. 

3.1 End Effector 

The end effector is defined to be the last coordinate system of an articulated figure. Notice that a compli¬ 
cated structure like a hand will have end effectors at the tip of each finger. 

In the notation in Chapter 2, the end effector is located in joint N . Usually, an animator is interested 
in the position and orientation of the end effector specified in base coordinates for easy use. 

Sections 2.2 and 2.3 derived two expressions (2.9) and (2.63) for the transformation °Tjv, which 
transforms the N th link frame to the base coordinate frame. The transformation is given by a 4 x 4 matrix, 
which is parameterized by the generalized joint parameters 0 , . This transformation is repeated here for 
convenience. 

°T n = °T n (0 u ...,0 n ) = o T 1 (0 1 ) 1 T 2 (02) • • • (Ar-1) lV(0jv) (3.1a) 

mu (0i, ■ ■ • ,0n) mi 2 (0 i,...,0Af) mi 3 (0i,... ,0 N ) mu(0i,... ,0 N ) 

m 2 i(0i,... ,0n) m 22 (0i,...,0Ar) m 23 (0i,... ,0jv) m 24 (0i,..., 0 N ) n 

m 3 i(0i,... ,0 N ) m 32 (0i,...,0Ar) m 33 (0i,... ,0 N ) m 34 (0i,..., 0 N ) ' 

0 0 0 1 

The location and orientation of the N th link frame might also be expressed relative to the base coordinate 
frame as three rotations, one around each of the coordinate axes followed by a translation. This results in 
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First: 

Yaw 

4- 

Rotate the angle 4 ; around the a>axis 

Second: Pitch 

0 

Rotate the angle 0 around the vy-axis 

Third: 

Roll 

$ 

Rotate the angle 4> around the z-axis 


Table 3.1: Order, name, and definition of the rotations. 


an alternative expression for the transformation °T N 

°T n = °T N (p , 0, ) = T(p)R z ($)R y (Q)R x ('£) (3.2) 

where T(p) is a translation matrix, and R z ( ( l>). R y (G). R x (4/) arc rotation matrices. This expression is 
a 4 x 4 matrix, where the elements arc functions of the translation vector p and the Euler angles 4>, 0, 
and 4 t 

The following sections will compute the transformation T(p)R z (&)R y (Q)R x ('S>) and establish the 
relations to the transformation °Tn(6i, • • •, On) given by (3.1). Finally, the notion of a state vector 
is defined. The state vector relates the position p and orientation 4>, 0, 4/ of the end effector to the 
generalized joint parameters 6\,..., riy 


3.1.1 Location of the End Effector T(p) 


The origin of the coordinate frame relative to some base coordinate frame can be specified by a translation 
by a vector p = \p x p y p 2 ] T . Using homogeneous coordinates, the coiTesponding transformation 
matrix Tip) is given by 


Tip) 


1 0 0 Pa; 

0 10 Py 

0 0 1 p z 

0 0 0 1 


(3.3) 


3.1.2 Orientation of the End Effector—Roll, Pitch, and Yaw 

Any 3D orientation relative to some base coordinate frame can be specified by three rotations, one around 
each of the coordinate axes. These rotations are named roll , pitch, and yaw respectively, and are shown in 
Figure 3.1. 

The order in which these rotations arc performed must be fixed. In the following, the order is: first 
rotate around the a>axis, second rotate around the p-axis, and finally rotate around the z-axis. The order 
and names of these rotations are summarized in Table 3.1, and described in the following sections. 


3.1.2.1 The Yaw Transformation Y(4/) = R x ( T) 


The Yaw transformation is a rotation by the angle 4' around the rc-axis, and it can be expressed as 


Y(tf) = R x W 


10 0 0 

0 cos 'F — sin 4/ 0 
0 sin 4< cos 4r 0 

0 0 0 1 


(3-4) 


■e 
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z 



Figure 3.1: Any orientation can be specified by three rotations, the Roll , Pitch , and Yaw around the x. y. 
and z axes respectively. The order of rotations are: rotate around the axis x. y.z. 


3.1.2.2 The Pitch Transformation P(0) = R y (Q) 

The Pitch transformation is a rotation by the angle 0 around the r/-axis, and it can be expressed as 


P(0) = Ry(e) = 


cos 0 0 sin 0 0 

0 10 0 

— sin 0 0 cos 0 0 

0 0 0 1 


(3.5) 


3.1.2.3 The Roll Transformation R(&) = R z { ( F) 

The Roll transformation is a rotation by the angle 4> around the z-axis, and it can be expressed as 


R($) = Rz($) = 


cos 4> — sin $ 0 0 

sin cos $ 0 0 

0 0 10 

0 0 0 1 


(3.6) 
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3.1.2.4 The Roll, Pitch, and Yaw Transformation Trpy($, 0, 4') 

The Roll, Pitch, and Yaw transformation denoted T rpy is the composition of the three rotations described 
above. The order of the composition is as follows 




R(<S>)P(Q)Y(V) = R z (<S>)R y (Q)R x (y) 


c4> 

—s4> 

0 

o' 


' C0 

0 

S0 

o' 


'1 

0 

0 

o' 

s4> 

c4> 

0 

0 


0 

1 

0 

0 


0 

c4> 

-s4^ 

0 

0 

0 

1 

0 


—S0 

0 

C0 

0 


0 

s4^ 

c4r 

0 

0 

0 

0 

1 


0 

0 

0 

1 


0 

0 

0 

1 


c4>c0 c4 > s0s4' — s4>c4' 
s4>c0 s4>s©s4' + c®C$/ 
—s0 c0s4' 

0 0 


c4>s0c4' + stbs’k 0 
s'hsOc'k - c4>s4r 0 
cOc'T 0 

0 1 


(3.7a) 


(3.7b) 


(3.7c) 


where s4> is shorthand for sin <I>, and c-4> for cos 4>, etc. 

The order of the rotations is essential: reading from right to left, first rotate around the a>axis, second 
rotate around the yy-axis, and finally rotate around the z-axis. 


3.1.3 The Transformation of the End Effector °T N (p , <I>. 0. T ) 


As outlined in the previous sections the 3D location and orientation of the end effector in some coordinate 
frame can be expressed as three rotations and a translation. Specifically, the coordinate frame of the end 
effector can be expressed in the base coordinate frame of the articulated figure as 


°T n = °T N (p , 0, *) = T(p)T RPY ( 0, 40 


1 

0 

0 

Px 


c4>c© 

c4>s0s4' - s4>c4' 

c'hsOc'P + s4>s4' 

o' 

0 

1 

0 

Py 


s4>c0 

s4>s0s4' + c<Pc4r 

sdJsOc'P - c4>s4r 

0 

0 

0 

1 

Pz 


—S0 

c0s4' 

c0c4' 

0 

0 

0 

0 

1 


0 

0 

0 

1 


c4>c0 c4>s0s4' — s4>c4' c4>s0c4' + s4>s4' p x 
s4>c0 s4>s0s4' + c4>c4' s<hs0c4' — cthsdr p y 
—s0 c0s4' c0c4' p z 

0 0 0 1 

where s4> is shorthand for sin <I\ and c4> for cos 4>, etc. 


(3.8a) 


(3.8b) 


(3.8c) 


3.1.4 Computation of the Parameters 

This section describes the relation between the general transformations °Tjv given by (2.9) or (2.58) in 
Sections 2.2.4 and 2.58 respectively, and the transformation (3.8c) from the previous section. 

More specifically, it describes how to compute the parameters, p, 4>, 0, 4', of the Roll, Pitch, and Yaw 
model described in Section 3.1.3, given an arbitrary transformation matrix (l T y which is a composition of 
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translations and rotations. Such a transformation matrix could be obtained as described in Sections 2.2.4 
and 2.3.6.5. 

A transformation °T n , which transforms points in the end effector’s coordinate frame to the base 
coordinate frame of the articulated figure (see (2.9) or (2.58)), can be represented as a 4 x 4 matrix, 
where the elements of the matrix m tJ arc functions of the joint parameters 0 1 ,..., 9n- Omitting the joint 
parameters 0-, for readability, the matrix has the form 



m it 

m 12 

m i3 

m i4 

o 

II 

m 2 i 

m-22 

W-23 

m 24 

m 3 i 

W-32 

m 33 

m 3 4 


0 

0 

0 

1 

this matrix is equal to [0 

0 

0 1] 


(3.9) 


homogeneous coordinates and that translations and rotations are isomorphisms. Therefore, there arc no 
perspective effects of this transformation. 

The matrix (3.9) and the matrix resulting from the Roll, Pitch, and Yaw model (3.8c) do the same 
thing. They transform entities specified in the coordinate system of the end effector to the coordinate 
system of the base of the articulated figure. Therefore, these matrices must be identical 


m u 

m V2 

m 13 



c<Fc0 

c<3?s0s47 - s<Fc47 

c4>s0c47 + sdxsT' 

Px 

m 2 1 

m 22 

m 23 

77124 


s4>c0 

s<Fs0s'F + c'bcT' 

s4>s0c47 - 

Py 

m 3 \ 

m 32 

m 3 3 

77734 


-sQ 

c0s4/ 

c0c47 

Pz 

0 

0 

0 

1 


0 

0 

0 

1 


By observation several relations can be found. The following sections show how to express the parameters 
p , <I>. 0, and T as functions of the matrix elements {m t? | i, j = 1,..., 4}. 


3.1.4.1 The Translation p 

From (3.10) it can be seen directly that the translation vector p = [p x p y p z ] 7 is given by 



Px 


77714 

p = 

Py 

= 

77724 


Pz 


777 34 


(3.11) 


3.1.4.2 The Yaw Angle T' 

The angle T can be computed as follows by inspection of (3.10) 


'T = arctan 


/ m 32 \ 

\m 33 J' 


This is possible because the following relations hold 

T = arctan ( 777-32 


\m 33 


f cos 0 sin’T 

= arctan 


V cos 0 cos T 


( sin dr 

= arctan - 

\cos 'T 


(3.12) 


(3.13) 
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The function tan has a period of n which might cause problems. In order to get the angle right, it is better 
to use the function arctan2 given by (2.47). The result then becomes 


T = arctan 2 (77732,77733). 


(3.14) 


3.1.4.3 The Pitch Angle 0 

At this point, the angle 47 is known, and it can by used for further computations. By inspection of (3.10), 
and the knowledge of the value of the angle T, the angle 0 can be computed as follows: 

0 = arctan (---) . 

\ 777.32 Sill W + 77733 COS 47 J 

This can be seen because the following relations hold 

0 = arctan ( -- -) = arctan ( - sm©- \ 

\ 77732 Sill 47 + 77733 COS W J \ COS 0 sin 47 + cos 0 cos 2 4/ J 

( sin 0 \ (sin 0 \ 

= arctan -- —3 --- = arctan -— . 

\cos 0 (sim 47 + cos 2 4i) / \cos 0 y 

The function tan has a period of 7 r, so in order to get the angle right it is better to use the function arctan2 
given by (2.47). The result is 


(3.16a) 

(3.16b) 


0 = arctan2(— 777 . 31,77732 sin 47 + 77733 cos 47) 


(3.17) 


3.1.4.4 The Roll Angle 4> 

The angle 47 can be computed as follows by inspection of (3.10) 


4> = arctan 



This can be done because the following relations hold 


4> = arctan 



= arctan 


sin 4> cos 0 
cos 4> cos 0 


= arctan 


( sin4> 


Vcos 4 ? 


(3.18) 


(3.19) 


The function tan has a period of it. This might cause problems in some cases, so in order to get the angle 
right, it’s better to use the function arctan2 given by (2.47) 


4> = arctan2(?77.2i, ?77ii) 


(3.20) 
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3.1.5 The State Vector 

Let the state vector s be defined as follows: 


Xposition 


Px 

Yposition 


Py 

Zposition 


Pz 

Yaw 


T 

Pitch 


0 

Roll 


<f> 


where the vector p = [p x p y p 2 ] 7 is given by (3.11), and the angles 4', 0, and <I> arc given by (3.12), 
(3.15), and (3.18). 

As shown previously, the elements of the state vector s arc computed from the elements of the trans¬ 
formation matrix °TV given by (3.9), and the transformation matrix is a function of the joint parameters 

6i,... ,0 N 


,.,9 N ) = °T l (9 l 

) . . . (* -1 )T.;(0 ? ;) • • - 

■( N -VT N (9 N ) = 



m u (9 i, • 

• • ,9n) 

mi 2 (0i,...,0jv) 

mi 3 (0i,..., 0 at) 

77714 (01 , • • 

•, 0/v 

771.21 (01, • 

■ ■ ,9n ) 

77722 (01, •••, 01V) 

77723 (01, ■■■,9 N ) 

77724(01, • • 

•, 9 n 

171-31 (01, . 

• • ,9n ) 

77732 (01,..., 0m) 

77733(01,..., 0at) 

77734 (01, • • 

■, 9 n 

0 

0 

0 

1 



(3.22) 


Because all the elements of the state vector s arc computed from the elements of this matrix, the state 
vector s is also a function of the joint parameters 0,. 9jy, which yields 


s(di,... ,9 n ) 


Px{9 1, • • • ,0 N ) 
Py(0 l,...,0jv) 
Pz{9 i, ■ ■ ■ ,6n) 

^(9 1,..., 9 n ) 
0(0i, ..., 9 n ) 
4>(0i, ..., 9 n ) 


(3.23) 


3.2 Forward Kinematics 

Forward kinematics can be explained as follows: given all the link and joint parameters of an articulated 
body, determine the 3D position and orientation of the link farthest away from the base, the end effector. 

The assumption is that the link and joint parameters arc known. Therefore, the generalized joint pa¬ 
rameters 6 1 ,..., 0 y arc known, and the transformation from the end effector to the base of the articulated 
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figure °T n (0i, ...,0n) can be computed as 


N\ 


..,0 N ) = °T l (0 l ) 

... (i-1) T.;(0j) • • ' 

■^ n ^T n (0n) = 



m n (0i, ■ 

• •, On) 

77712 (01, - - - , 01V) 

777 i 3 (0i,..., 0jv) 

777l 4 (01, • • 

•, On 

771-21 (01, • 

■ ■ - On) 

777 2 2 (01, • • • , 0 N ) 

77723 (01,...,0jv) 

77724(01, • • 

■, On 

777-31 (01, . 

■ ■ ,0n) 

77732 (01,..., On) 

77733(01,..., 0jv) 

77734 (01, • • 

■, On 

0 

0 

0 

1 



(3.24) 


The elements of the matrix { 777 ^ (0 1 ,..., 0 v)} | i,j = 1,..., 4} can be computed as described in Sec¬ 
tions 2.2 and 2.3. From the transformation °Tjv(0 1 ,. . ., 0jv), the state vector s(0 1 ,..., On) can be 
computed as 


s(0i,... ,0 N ) 


Px(01, . . . ,0 n) 

Py(0l, . . . ,0 N ) 
Pz(0i ,... ,0n) 
4 /( 01 , ■ ■ ■- On) 
@{0i, • • •, On) 
4 >( 0 i , ..., 0 N ) 


(3.25) 


The elements of the state vector arc functions of the matrix elements rriij(0 1 ,... , On) and therefore also 
of the generalized joint parameters 0\, ..., On- The individual elements of the state vector s arc computed 
as described in Section 3.1.4. 

Hence, by specifying all the generalized joint parameters 0, the position p and orientation T, 0, 4> of 
the end effector can be computed by computing the state vector s. 


3.3 Inverse Kinematics 

Inverse kinematics can be explained as follows: given a 3D position and orientation of the link furthest 
away from the base, the end effector determines the parameters of the individual links and joints. 

In forward kinematics, all parameters arc given, and the only real work is to till in all the values and 
calculate the state vector. In contrast, inverse kinematics is more challenging. Given an end effector 
position, there can be numerous valid solutions for the state of the remaining states of the articulated 
figure. This can be verified by a simple test: place your index finger on a hard surface and move your 
elbow. All positions the elbow passes through presents valid solutions of the articulated system (the whole 
arm). 

The first paid of this section describes the theory, shows how to set up an equation system, and how to 
reach valid solutions. Finally, a complete walk through is given using the Denavit-Hartenberg notation. 


3.3.1 Computation of the Joint Parameters 0, 

Assume that one wants to locate the end effector at some goal position p g with some goal orientation 
4> 9 , O g , This results in a goal state vector s g which the end effector should move to. The problem is 


© 


© 
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now how to compute the joint parameters 0,. i = 1,..., N such that the end effector reaches its goal. 
Stated mathematically, it means: determine a vector \6\...., On] 7 such that the function 

f(0 t,•• •, On) = s(6i ,..., On) ~ s g = 0. (3.26) 


We will now study the Taylor expansion of s — s g , but for convenience this will be done using the function 
f. The Taylor series is discussed in detail in Chapter 20. The following sections describes a method that 
uses an iterative process that slowly crawls toward the goal where f is sufficiently close to zero. 

At each step in the iteration scheme presented, one has to calculate a new guess for the joint param¬ 
eters. This is done via a Taylor expansion and the derivative of the state vector seeded with the old joint 
parameter values. These values are then fed to the next iteration. 

In this section, we present the Taylor expansion together with the step equation. The following three 
sections present three special cases for computing the new guess for the joint parameters. Finally, we walk 
through how to compute the derivative matrix of the state vector. 

The Taylor expansion of the function f is equal to 


f{(0i ,. . ■, 0tv)+A(0i, ..., On)) = 

m, ■■■,o N ) + ’""ff A ( g i’- • • > e ^ + t 3 - 27 ) 

o(\\A(0 u ...,0 N )\\ 2 2 ). 

Considering only the first two terms yields 

f{(01, ...,0 N ) + A (0 U ..., 0 N )) « f(0 u ..., 0 N ) + A^t, ■ ■ ■, On). (3.28) 


The goal is that for some A(#i,..., On) the result should be zero, in which case the end effector is at the 
right position and has the right orientation. Therefore, let the new value of f be equal to zero 


f{(0 i, • • •, 0 N ) + A(0i,..., On)) = 0. 
Substituting this into (3.28) yields the following equation 

0 ~ f(0i ,..., 0 N ) + — -A-^A(0i,..., 0 N ) 


<9(0i,..., 0 N ) 


which can also be written 


f(0i, • • •, 0 N ) « ~ 9 qIq 1 1 -^y- A ( 0 t> • • • > On) 


(3.29) 

(3.30) 

(3.31) 


The function f only differs from the function s by a constant s g . Differentiating a constant yields zero, 
so the Jacobian matrix is equal to 


df(0 1 ,...,0 N ) _ ds(0 ll ...,O N ) 
d(0i, ■ ■ ■, On) d(0i,...,0N) 


■e 


© 
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where 


dp z 

fMgi.giv) 

0<T 


;(0i, ■ ■ ■ i On) 

-(0i ,..., 0 N ) 
-(0i ,...,0 N ) 


Therefore (3.31) can now be written 


„(p p \ „ ^ 0 S (01) ••••', 0Af) » /p p \ 

s( 01 , .... C7jvj - ~-^-——A(tyi, . . . , ON). 

0(01, • • • , 0AfJ 

The difference joint pai'ameters vector A(0i,..., 0 y) is equal to 


A(0i,..., 0 at) = 


Qnew _ 


'iv - »ivj 


and we conclude that 


S(01, . . . , 0 N ) - Sg « - 


ds(0t, • • • ,0/ 

9(01,..., e N 


(3.33) 


(3.34) 


(3.35) 


(3.36) 


This equation is derived using only the two first terms of the Taylor expansion of s, and it forms the basis 
of the following derivations. It is only an approximation, but it can be used as an iteration scheme; see 
Section 19, to obtain the desired solution. This will be shown in the following sections. 

3.3.2 The Regular Case 

If the Jacoby matrix ds(6\,, 0iv)/0(0 i, ..., 0,v) of the vector function s is invertible, the matrix 


ds(6 u ... ,e N ) 

0(0i,..., 0 N ) 


(3.37) 


exists, and (3.36) can be rewritten 


ds(0 1 ,...,d N 
0(01,..., d N ) 


\ 

-j (S(0i,...,0jv) - Sg) 


(3.38) 


new 
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Because the matrix 3s(0 1 , ..., 0at)/3(0i,..., 0 n) is a function of (0i,..., 0 n), it must be inverted and 
evaluated at each iteration. 


3.3.3 The Over-Determined Case 

If the Jacoby matrix 3s(0 1 ,..., 0n)/9{0\ ...., 0 y) is over-determined, then it has more rows than 
columns, and it can not be inverted. In this case the pseudoinverse can be used to obtain a least-squares 
solution. To make the notation readable, let 


_ ds{0 1 ,...,0 N ) 

3(0i,... ,0 N ) 

In the over-determined case, the pseudoinverse has the form (see Chapter 19.4) 

J+= {J T J)~ 1 J T . 


Insert J + instead of (3s(0i, ..., 0n)/9(0 1 ,..., On)) 1 in (3.38) yielding 


1 

Qb • 

1_ 


0i 

On 

new 

0 N _ 


J + {S{ 61 , . • .,0 N ) - Sg ). 


(3.39) 


(3.40) 


(3.41) 


The matrix T+ is a function of (0i ,,0n). Therefore, it must be evaluated at each iteration. 


3.3.4 The Under-Determined Case 


If the Jacoby matrix ds(0 1 ,..., 0n)/<)(0\ ,.... 0 \-) is under-determined, it has more columns than rows, 
and is not be invertible. In this case the pseudoinverse can be used to obtain the least-squares solution, 
which has the smallest norm. To make the notation more readable, let 


ds{0 1 ,...,0 N ) 
3(0i,..., 0 jv) 


(3.42) 


In the under-determined case, the pseudoinverse has the form 

J + = J T {JJ T )~\ 


(3.43) 


Inserting J + instead of (3s(0 1 , ..., 0tv)/3(0 i,..., On)) 1 in (3.38) results in the following equation 


0i 


0i 

On 

new 

On 


J + (s{0 1 ,..., On) ~ s g ). 


(3.44) 


Because the matrix J+ is a function of (0i,..., 0..y), it must be evaluated at each iteration. 
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3.3.5 Computing the Jacobian of the State Vector 

In Section 3.3.1 we showed generally how to set up an iteration scheme (see Section 19), to compute the 
joint parameters 0\,.... 9n- The only thing that is missing is how to compute the Jacoby matrix of the 
state vector s. The Jacobian of s is equal to 


ds( 6 1 


3(0i,. 


,0 


N) 


,0 


N 


dpx m 
30? 01 ’ 
d 'P y I n 

d Pz lf} 
d6\ 15 

d6\ 15 

<94* 


0jv) 

On) 

9n) 

On) 

On) 

On) 


dp x 

do N 

°Py 

do N 

OPz 

90 n 

d^> 

n 

90 N 

<94* 

90 N 


(0t,. 


,0 


N) 


( 0 t ,...,0 


(0i,. 

(0i,. 


N) 

■ On) 

■ On) 


( 0 i ,...,0 

( 0 i ,...,0 


N 


N) 


(3.45) 


From (3.22) it can be seen that the transformation °T/v has a special form 


1 T’at(0i, ..., 0 N ) 

= °Ti(0i) • • • (i “ 2) r (i _ 1) (0 ( j_ 1) ) (i - 1 )T l (0 i ) l T (i+1) (0 (i+1) ) • • • ^T n ( 0 n ). 


(3.46) 


When computing the partial derivative of the transformation (l T v(0 1 ,.... 0 y) with respect to some vari¬ 
able 0 i, only one matrix ]i T, depends on the variable 0j. This observation will be used in the following 
to compute the elements of the Jacoby matrix of the state vector. 

Considering partial differentiation with respect to the joint variable 0,, let the matrices P and C be 
given as 


P = °T (i _ 1) (0 1 ,..., 0,:_i) = °Ti(0i) • • • F -2 ) j , .(0 j _i) ( 3.47a) 

C = l T N (0 {i+ 1),..., 0 N ) = l T (i+1) (0 (i+1) ) • • • ^Tn( 0 n ). (3.47b) 

The choice of name P indicates that the matrix contains the transformations of all the links that are 

parents to linki, and the name C indicates that the matrix contains the transformations of the links that 

are children of linki. 

It can be seen that the matrices P and C arc constant with respect to the variable 0,. Therefore, 
parti al differentiation of the matrices with respect to the variable 0, yields zero. With these assumptions, 
the transformation °Tn can be written 

°T N (6 i) = p(0-V Ti (0i)^C (3.48) 


■e 
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which is a product of three matrices where only the one matrix {m )? | i,j = 1,..., 4} is a function of 
joint variable 0, 


pll 

P12 

Pl3 

P\4 


mn(0i) 

m V2 (0i) 

mi 3 (0i) 

mu(0i) 


Cll 

C12 

C13 

C14 

P‘2 1 

P22 

P23 

P24 


m 2 i(0i) 

m 2 2(0i) 

m 23 (0i) 

m 2A (0i) 


C21 

C22 

C23 

C24 

P'S 1 

P32 

P33 

P34 


rn :i i (Oi) 

m 32 {0i) 

m 33 {0i) 

rri34 (Oi) 


C31 

C32 

C33 

C34 

0 

0 

0 

1 


0 

0 

0 

1 


0 

0 

0 

1 


Under these assumptions, the (kj) th element of the matrix °Tjv, denoted (°T v) l: j ■ can be written 


4 4 

0 T N (Oi) kj = ^2^2pkhmhi(Oi)cij (3.50) 

i=i h =t 

and its partial derivative with respect to 0, is equal to 


which is equivalent to 


d{°T N (Pi) kj ) 


1=4 h =4 


1=1 h= 1 


/ dm h i (0 i) 

V m 


c lj 


d{°T N (o,) kJ ) n fd{^T i (e l ))\^ 
00, ~ k * V do, r* j 


(3.51) 


(3.52) 


where P k * denotes the k lh row of matrix P, and C, :j denotes the j th column of matrix C, and 



~dmu(0i) 

dm V2 {0i) 

dm 13 {0i) 

dmu(0i)' 


d(^T,{0 t )) 

do, 

do i 

dm 2 i(0i) 

do, 

dm 22 {0i) 

d0, 

dm 23 (Oi) 

d0: 

dm 24 (Pi) 


dOi 

dm 3 i{0i ) 

d0, 

dm 32 (0i) 

d0, 

dm 33 (0i ) 

dOj 

dm 34 (0,) 

(3.53) 


do, 

dnuiiOi) 

d0, 

dm&XOi ) 

d0, 

dm 43 {0i) 

dOj 

dmu{0i ) 



1 dOi 

dOi 

d0, 

dOi \ 



With (3.53) and either (3.38), (3.41), or (3.44), we can now solve the iterative scheme of (3.36). 

When using the Denavit-Hartenberg description of an articulated figure, each joint is restricted to have 
only one degree of freedom: it has either a translatory parameter d, along its z-axis if it is a prismatic joint. 
or it has one rotational parameter p, around its z-axis if it is a revolute joint. If a joint needs more degrees 
of freedom, several joints with link lengths equal to zero arc located on top of each other. This means that 
the generalized parameter vector 0, = (p r , d, ) 1 becomes an ordinary parameter. For a prismatic joint, the 
generalized parameter 0, becomes d t , and for a revolute joint it becomes p % . 

The following sections show examples of how to compute the Jacoby matrix of a prismatic and a 
revolute joint. 


■e 
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3.3.6 Example Using Denavit-Hartenberg 

Recall that the state vector s is given by 


Px(@i. 
Py (@i, 
Pz{0i, 
*(0i} 

e(0i) 


arctan 


arctan 


m u (6i) 

m 3 4(0i) 

f m 3 2(0j) 

\m 33 (di) 


(3.54) 


m 3 2 (di) sin \U(0j) + m 33 (Pi) cos \&(0j 

m 21 (0i)\ 


arctan 


mn(0i 


where rn^jiOj) is the kj th element of the matrix given by (3.48). The matrix is repeated here for conve- 


T N (e t ) = p(^T t (e t ))c 


(3.55) 


where the matrices P and C are given by (3.47). 

When using the Denavit-Hartenberg notation, the generalized parameter is equal to 6, = 4,] 

and the transformation matrix ( 1 ! T, (0,) is given as 


cos ipi — sin(^j 0 di -1 

{i-P)T(d- • ) = COSQ 7-i sin ^i cos «i_i cos (pi -sin«j_i -di sina^-i 

i \ nVi) s j n ai-i sin (pi sin i cos tpi cos a\ d, t cos a i 


(3.56) 


The Jacobian of the state vector s is given as 

r dp x 




ds(90 N ) = d6i 

d(0\ . 9 S ) <&_ 




f/ < f > 


^. 

^. 

. 

“4 # 

a^ (e ‘. 9w 

. 


where the kj th entry is the partial derivative with respect to the variable 0 k 

d(°T N (0i)) kj _ n fd{^T i (e i ))\^ 

86 i k *\ 86 i J * J 


(3.57) 


(3.58) 
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The matrices P and C arc given by (3.47). 

In the following sections, the matrix di^^T^Oi)) / dOi is computed for both a prismatic and a 
revolute joint. 


3.3.6.1 A Prismatic Joint 

The general transformation matrix 11 T, is given by (3.56). For a prismatic joint, the generalized joint 
parameter 0, is equal to 0, = d,. Therefore, the partial derivative of - l l: 'T, given by the matrix (3.56) 
reduces to 

dp-VTijOj)) 

ddi 


0 0 0 0 

0 0 0 — sinai_i 

0 0 0 cosctj—i 

0 0 0 0 


(3.59) 


The State Variable p(di) The state variable p is given by (3.11) as 


P{di) 


Px 


m u 


■(°T N (d l )) u 

Py 

= 

m 24 

= 

{°T N (di)) 24 

Pz_ 


m 3 4_ 


{°T N (di)) 34 


(3.60) 


Computing the transformation matrix °T^{di) using (3.55) and (3.56) yields the following expression for 
the location vector p 

p x (di) = ai-ipn T Pi 4 ~ P12C34 sinaj_i - dip^sina^ 

+ (pncu + P 13 C 24 sin ccj-i) cos p t - p u c 2 4 sin ^ + P 13 C 14 sin a,_i sin ip t 
+ (pi 3 (C 34 + di) + P 12 C 24 cos ipi + P 12 C 14 sin </?,) cos aj_i (3.61a) 

Pyidi) = ai-ip 2 i +P24, ~ F22C34 sinaj -1 - dip 2 2 sin 1 

+ (P21C14 + P23C24 sin aj_i) cos - P21C24 sin ip t + P23C14 sin 1 sin ip t 
+ (P23 (C 34 + d t ) + P 22 C 24 cos ipi + P 22 C 14 sin ifi) cos ai-i (3.61b) 

Pz(di) = a;—1P31 + PM ~ P32C34 sin ati-1 - diP 32 sin 1 

+ O 31 C 14 + F 33 C 24 sin a t - 1 ) cos c pi - p 3 ic 24 sin c pi + p 33 c 14 sin a t -\ sin p l 
+ (P33 (c 3 4 + di) + p 32 C 24 cos <Pi + p 32 Ci 4 sin <fii) COS CXi-i. (3.61c) 


The partial derivative of the location p with respect to the variable di is equal to 



~dp x (di)~ 


rd(°T N (d t )) u -] 


Pu( 

^d^-d Ti (di))\^ ' 


ddi 


ddi 


V ddi ) 

Or 
Ca *53 

II 

dp y (di ) 
ddi 

= 

d{°T N (di)) 2 4 
ddi 

= 

P2*( 

fd^Ti(di))\^ 

\ ddi ) * 4 


dp z \d>i) 

- ddi - 


d{°T N (di)) 3 4 

ddi 


P 3 * ( 

fd(( i -d Tl (d. l ))\^ 

v ddi ) * 4 \ 


(3.62) 


© 


© 


© 
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where the matrices P and C are given by (3.47), and the matrix l)(' t ^ l) T l (d l )) /ddi is given by (3.59). 
Multiplying the matrices yields the following expressions for the partial derivative dp/ddi of the state 
variable p with respect to the joint parameter <7* 


dp x (dj ) 
ddi 

dpyjdj) 

ddi 

dpzjdj) 

ddi 


= p i 3 cos a t -\ - pi 2 sin ai-i 
= P 23 cos a t -\ - p 22 sin a*_i 
= p 33 cos ai- i - p 32 sin i. 


(3.63a) 

(3.63b) 

(3.63c) 


The Yaw Angle 4' (d, ) The state variable 4/ is given by (3.12) as 

4 f (dj) = arctan /^— 32 j (3.64) 

\m 33 (di) J 

and its partial derivative with respect to the variable d t is equal to 


= A arctan ( 

ddi ddi ‘ \m 33 (di)J 


1 _ 9 f m 32 {di) \ 

f m 32 (di ) \ 2 ddi \m 33 (di) J 

\m 33 (di)J 


1 

f m 32 {dj) 
\m 33 (d l ) 

1 

f m 3 2(dj) 

\m 33 (di) 


dm 32 (dj) ,dm 33 {di 

^33 \fli) TU 3 2 \fli 


ddi 


m, 33 (di 


dm 3 2 (di 


m. 33 (di 


ddi 


m 3 2(dj) dm 33 (dj) \ 
m 33 (di) 2 ddi J 


(3.65a) 

(3.65b) 


(3.65c) 


= 0 (3.65d) 


The elements m/. ; ( d t ) are elements of the matrix °T n , which are given by (3.55) and (3.56). Computing 
°Tjv yields the following expressions for the elements m 32 ((/,;) and m 33 {di) 


m 32 (di) = -p 3 2C 32 sinai_i + (p 3 ic 12 + p 33 c 2 2 sina^ijcos p % 

~ P31C22 sin Pi + p 33 ci 2 sin a,;_ i sin p l 

+ (P33C32 + P 32 C 22 cos Pi + P32C12 sin Pi) cos a;_i (3.66a) 

m 33 {di) = -p 3 2 C 33 sinai_i + (p 31 c 13 + p 33 c 23 sina ? _i) cos p t 
- P 31 C 23 sin pi + P 33 C 13 sin a*_ 1 sin p t 
+ (P33C33 + P32C23 COS pi + P 32 C 13 sin p t ) COS 1 


(3.66b) 
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The elements dm^jid/) / ddi arc given by 

dm kj _ ( dp-VTjjdi)) 
ddi k * \ ddi 


c 


*3 


(3.67) 


where the matrices P and C arc given by (3.47), and the matrix 0 ('' 1 ! T l (di)'j/ddi is given by (3.59). 
This yields 


dm 32 (di 

ddi 

dm 33 (dj 

ddi 



(3.68a) 


(3.68b) 

equal to zero 


3*(di) n 

(3.69) 


The Pitch Angle 0(dj) The state variable 0 is given by (3.15) as 


@(di) = arc tan f 
The expression for 0 has the form 


~m 3 1 (di 


\m 32 (di) sin 4/(di) + m 33 (di) cos T'(di) ) ' 
N(d, 


@(di) = arctan 


D(di 


where the numerator N and the denominator I) are equal to 


N(di) = -m 3 i (dj) 

D(di) = m 32 (di) sin (dj)+ m 33 (dj) cos T'(dj) 


(3.70) 

(3.71) 

(3.72a) 

(3.72b) 


where the angle 4i is given by (3.64) and (3.66). 

The elements mkj(di) are elements of the matrix (l T ; .y, which arc given by (3.55) and (3.56). Com¬ 
puting °T ; y yields the following expressions for the elements 771.31 (di), m 32 {di), and m 33 {di) 

77131 (di) = -P32C31 sin «i_i + (f> 3 icn +P33C21 sinai_i) cos^i 

- P31C21 sin tfi + P33C11 sin <+_! sin ipi 

+ (P33C31 + P32C21 cos ipi + p 32 cu sin w) cos <+_ 1 (3.73a) 

77132(dj) = —p 32 c 32 sin «i_i + (p 3 ici2 +p 33 c 22 s\na.i-i) cos (pi 
~ P31C22 sin tfi + P33C12 sin 1 sin ^ 

+ (T33C32 + T32C22 COS ^i + P32C12 sin ifi) cos <+_! (3.73b) 

77133 (dj) = —T32C33 sin cii_i + (p 3 ici3 +p 33 C23sinai_i)cos^i 

- P31C23 sin + P33C13 sin a*_i sin 

+ (P33C33 + P 32 C 23 cos v?i + P32C13 sin ^j) COS «i_l ( 3 . 73 c) 
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Using this simplified notation, the partial derivative of 0 with respect to the joint variable dj can be 
written 


dO(di) 

ddi 


i + 


l 

N{di) 

D(di) 


2 


_d_ ( N(dj) \ 
ddi \D(di)J 


D(di 


dNjdj 

ddi 


~N{di 


dD{dj 
* ddi 


1 + 


( Njdi, 

\D{di 


D{di 


1 


1 + 


Njdj) 

D{di) 


2 


( 1 dNjdj) 
\D{di) ddi 


N(di ) dDidi)\ 
DidiY ddi ) 


(3.74a) 


(3.74b) 


(3.74c) 


where expressions for N and D have already been derived in (3.72). Expressions for dN (di) / ddi and 
dD{di)/ddi will be derived in the following 


dNjdj) 

ddi 

dDjdj) 

ddi 


+ 


dm 31 jdj) 

ddi 

inwidi) sin 4/(dj) + m 33 (di) cos ^(dj) 
sin ^(di) + m 32 idi ) cos 

dm 33 (di 


ddi 


cos T 1 id, ) - m 33 (di ) sin 'E (di 


ddi 

d*(dj) 

ddi 


The elements dm^jidi)/ddi are given by 


dm kj _ ( d^Tijdi)) 

ddi k * V ddi 


C 


*3 


(3.75a) 

(3.75b) 

(3.75c) 

(3.75d) 

(3.76) 


where the matrices P and C are given by (3.47), and the matrix d (■ 1 ^Tj(di)) /ddi is given by (3.59). 

This yields 


dm 31 (dj) ( dp-VTjjdj)) 

ddi 3 * \ ddi 


C*i = 0 


(3.77) 


from which it follows that the partial derivative of the numerator dN(di) / ddi is equal to zero. In (3.65d) 
it was shown that the partial derivative 0 X V (dj) / 0d r is equal to zero, and (3.68) shows that the partial 
derivatives dm 3 i/ddi and dm 33 /ddi arc also equal to zero. Therefore, the partial derivative dP(di)/ddi 
is equal to zero 


dm 31 (dj) _ dm 32 (di) _ dm 33 (dj) _ d^jdj) _ dNjdj) _ dP(d t ) 

ddi ddi ddi ddi ddi ddi 


© 


© 


© 


© 
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from which it can be seen that (3.74c) is equal to zero 

dN(di) _ dD(di) _ n 


dOjdj 

ddi 


(3.79) 


The Roll Angle The state variable 4> is given by (3.18) as 


4>((Zj) = arctan 


m 2 i{di) 

mn(di) 


(3.80) 


and its partial derivative with respect to the variable p t is equal to 


d$(di) _ d + (m 2 i{di 

ddi ~ ddi aiC ai \m n (di 


_L_ d f m 2 i{di) 

m 2 i(di )\ 2 ddi \mn(di) 
m n (di) J 


m 21 (dj 

m n (di 

1 

m 2 i(di 

m n (di 


dm 2 i(di) dm u (di 

mu(di) -—- m 2 i(di) -—— 

_ ddi _ ddi 

m n (c/j) 2 


_ ( 1 dm 2 i{dj) _ m 2 i(dj) dmn(dj 

2 \mn(di) ddi mii(dj) 2 ddi 


(3.81a) 


(3.81b) 


(3.81c) 


(3.81d) 


The elements rrikj(di) arc elements of the matrix °T]\r, which arc given by (3.55) and (3.56). 
°T/v yields the following expressions for the elements rn 2 \ (d,j) and m\ \ (d r ) 

mn(di) = -P12C31 sinai_i + (pucn + P13C21 sinaj_i) cos <pi 
~ P11C21 sin (fi + P13C11 sin a*_ 1 sin p % 

+ (P 13 C 31 + P 12 C 21 COS ifi + P 12 C 11 sin tfi) cos 1 
m 2 i(di) = -p 22 c 3 i sinai_i + (p 2 icn +P 23 C 21 sina, ; _i) cos tpi 
~ P 21 O 21 sin ^ + p 23 cn sin a*_ 1 sin p t 
+ (P 23 C 31 + P 22 C 21 COS Pi + p 22 cn sin Pi) cos a *-1 


Computing 


(3.82a) 


(3.82b) 


The elements dmkj(di)/ddi are given by 


ddi A ' l ddi ) 


(3.83) 
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where the matrices P and C arc given by (3.47), and the matrix d ('' l) T l (d l ) 'j /ddi is given by (3.86). 
This yields 


dmn(di) 

ddi 

dm 2 i(dj) 

ddi 



(3.84a) 


(3.84b) 


from which it can be seen that (3.8 Id) is equal to zero 

d${dj) 

ddi 


(3.85) 


3.3.6.2 A Revolute Joint 


The general transformation matrix ' z ~ 11 T, is given by (3.56). For a revolute joint, the generalized joint 
parameter 6, is equal to 0, = p it Therefore, the partial derivative of given by the matrix (3.56) 

reduces to 


dp-VTijOj)) 

ddi 


— sin p>i — cos (pi 0 0 

cos ccj-i cos (pi — cos a*_i siny?j 0 0 
sinccj-i cos <pi — sinaj-i sin</5j 0 0 

0 0 0 0 


(3.86) 


The State Variable p(di) The state variable p is given by (3.11) as 


P{Pi) 


Px 


m u 


\ 0 T N (pi)) u 

Py 

= 

m 24 

= 

C °T N (Pi )) 24 

Pz_ 


_m 34 _ 


(°T N (pi)) 34 


(3.87) 


Computing the transformation matrix °Tjv(<F«) using (3.55) and (3.56) yields the following expression 
for the location vector p 


Px(<Pi) = CLi-ipu +P14 — P12C34 sin <Xi-i - diPi -2 sin a*_i 

+ (Piicu + P13C24 sin ai- 1) cos pi - P11C24 sin p t + P13C14 sin «j_i sin p % 

+ (p \3 (c 3 4 + di) + P12C24 cos pi + P12C14 sin p/) cos 1 ( 3 . 88 a) 

p y (pi) = ai-ip 2 i +P2A -p22C34sin«j_i - diP22 sin a*_i 

+ (P21C14 + P23C24 sin 1) cos pi - P21C24 sin Pi + P23C14 sin «i_i sin p t 
+ (p23 (C34 + di)+ P22C24 cos pi + P22C14, sin p t ) cos a t -\ (3.88b) 

p z (pi) = CLi-iPzi +P 34 — P32C34 sin Clj-l - diP 32 SillQ;j_i 

+ (P31C14 + P33C24 sill ai- 1) cos Pi - P31C24 sin p t + p 33 c 14 sin 1 sin p t 
+ (P33 (C34 + di) + P32C24 cos p t + p 32 c 14 sin pi) COS «j_i (3.88c) 
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The partial derivative of the location p with respect to the variable is equal to 


9p(<fi) 

dtfi 


~dp x (<pi)~ 


rd{°T N (pi)) u i 

difi 


ddi 

dPy(<Pi) 


d(°T N (pi)) 24 

d<Pi 


ddi 

UPz vPi) 


cc 

""5; 

h 

O 

<5^ 

- dw . 


L ddi J 


Pi* 

Pi* 

P 3 * 


dp-VTifri)) 

dw 

dp-VTiitpi)) 

d<pi 

d<pi 


c 


*4 


c 


*4 


c 


*4 


(3.89) 


where the matrices P and C are given by (3.47), and the matrix <9 (<£>*)) jdcpt is given by (3.86). 

Multiplying the matrices yields the following expressions for the partial derivative dp/dip i of the state 
variable p with respect to the joint parameter ipi 


dpx(fi) 

difi 


dPyiVi) 

dpi 


dpz{<Pi) 

dipi 


(-P11C24 + P12C14 cos ai -1 + P13C14 sin CKj_i) cos 
- (piicu +pi 2 C 24 Cosaj_i +pi 3 C24sina i _i)sin( / ?j 
(-T21C24 + T22C14 COS ai- 1 + P23C14 sin 1) cos 
~ (P21C14 + P22C24 COS ai- 1 + P23C24 sin ok*_i) sin 
(-P 31 C 24 + P 32 C 14 COS + P33C14 sin CKj_i) COS Ifi 

~ (P 31 C 14 + P 32 C 24 cos ccj_i + P33C24 sin ccj-i) sin (fii 


(3.90a) 


(3.90b) 


(3.90c) 


The Yaw Angle \k(<£>j) The state variable T is given by (3.12) as 


'&(pi) = arctan 


m 32 (pi) 

m 33 (<pi) 


(3.91) 
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and its partial derivative with respect to the variable pi is equal to 

d*(<Pi 


d<pi 


9 , (m 32 (pi)\ 

—— arctan --—- 

uipi \ ^33 yPi ) J 

d ( m, 32 (<pi 


1 


1 + 


f m 32 (<pi) y d<pj \m 33 (ipi) 

\m 33 ((pi)^ 


(3.92a) 

(3.92b) 


.dm 32 (<pi) , x dm 33 (<pi 

^33 vPi) o ^32 yPi 


dcpi 


dipj 


1 + 


/ m 32 (p 
\rn33 (ip, 

1 


1 + 


f m 32 (ipi 
\m 33 (ipi 


m 33 (<pi) 2 

dm 32 ((pi) m 32 (ipi) dm 33 (ipi 


)\ 2 \m 33 (pi) d(fi 


m 33 (ipi 


dpi 


(3.92c) 


(3.92d) 


The elements arc elements of the matrix °T n , which are given by (3.55) and (3.56). Computing 

°Ttv yields the following expressions for the elements m 32 (p t ) and ui 33 Up r ) 

m 32 (Vi) = -p 3 2 C 32 sinai-i + (p 3 ic 12 + p 33 c 2 2 sina*_i) cos pi 
~ P31C22 sin Pi + p 33 C !2 sin a*_i sin p l 

+ (T 33 C 32 + P 32 C 22 cos pi + p 32 ci 2 sin cos «»_! (3.93a) 

m 33 (pi) = -P32C33 sin ai-i + {p 3 ic 13 + P33C23 sin a,_ 1 ) cos p t 
- P 31 C 23 sin pi + P 33 C 13 sin a,;_ i sin p t 

+ (T33C33 + P32C23 cos Pi + P32C13 sin Pi) COS 1 ( 3 . 93 b) 

The elements dmkj(pi) I dp i are given by 


dm 


kj 


dpi 


= P 


k* 


d^Ti(pi)) 

dpi 


c 


*3 


(3.94) 


where the matrices P and C are given by (3.47), and the matrix 0 ( 1 ' l Ti(pi)')/dpi is given by (3.86). 
This yields 

'd{^T t (pi)) 


dm 32 (pi 

dpi 


dm 33 (pi 


= P 


3 * 


dpi 


c 


*2 


= (-P 31 C 22 + P 32 C 12 COS 1 + P33C12 Sm CTj-i) COS Pi 

- {P 31 C 12 + P32C22 cos a.i -1 + P33C22 sin «j_i) sin p % 

'd^Ti(pi)) 


dpi 


= p 


3 * 


dpi 


c 


*3 


= (-P31C23 + P32C13 COS 1 + P33C13 sin Oii- 1) COS Pi 

~ (P31C13 T P32C23 cos oti—i + P33C23 sin a-i-i) sin<^ 


(3.95a) 

(3.95b) 

(3.95c) 

(3.95d) 
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The Pitch Angle 0 (pi) The state variable 0 is given by (3.15) as 


&(pi) = arctan( 
The expression for 0 has the form 




\m 32 (pi) sin V(<fii) + m 33 (<pi) cos&(<pi) 


O(pi) = arctan 

where the numerator N and the denominator D arc equal to 


( N{p 

\D(<P: 


N(ipi) = —m 3 i(<pi) 

D(ipi) = m 32 (pi) sin ^(pi) + m 33 (pi) cos ^(pi) 


(3.96) 

(3.97) 

(3.98a) 

(3.98b) 


where the angle T is given by (3.91) and (3.93). 

The elements rrikj(pi) arc elements of the matrix °T ; .y. which arc given by (3.55) and (3.56). Com¬ 
puting "TV yields the following expressions for the elements rn 3 \ {pi), m 32 (pi), and m 33 {pi ) 

m 3 i(cpi) = -p 32 c 3 i sinaj-i + (p 3 ic u + p 33 c 2 i sina 8 _i) cos pi 

- P31C21 sin tpi + p 33 cii sin a ? _i sin p l 

+ (T33C31 + T32C21 cos pi + p 32 cu sin cos a;_i ( 3 . 99 a) 

m 32 (Pi) = -P32C32 sin 1 + (p 31 c 12 + P33C22 sin a*_ 1) cos p t 
~ P31C22 sin pi + P33C12 sin a*_ 1 sin pi 

+ (T33C32 + P32C22 cos Pi + P32C12 sin y?;) cos a;_i ( 3 . 99 b) 

^33 (</>*) = -P 32 C 33 sin ctj-i + (p 3 ici3 + P33C23 sin a,_i) cos p t 

- P31C23 sin pi + P33C13 sin a t -\ sin pi 

+ (P33C33 + P 32 C 23 COS Pi + P32C13 Sm pi) COS CCj-l ( 3 . 99 c) 


Using this simplified notation, the partial derivative of 0 with respect to the joint variable pi can be 
written 


d&(pi) 

dpi 


1 + 


1 

N{pi) 
D{pi) 


2 


d ( N(pi) \ 

dpi \D(pi)J 


1 + 


1 

N{pi) 

D(pi ) 


2 


D{pi 


dN(pj 

dpi 


— N(pi) 


D{pi 


dD{pi ) 
dpi 


1 


1 + 


N(pi) 

D{Vi) 


2 


( 1 dN(pj) _ N(pj ) dP(pj) \ 

\D(Pi) dpi D(pi) 2 dpi ) 


(3.100a) 


(3.100b) 


(3.100c) 
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where expressions for N and D have already been derived in (3.98). Expressions for dN{pi) / dpi and 
dD(tpi)/d(pi will be derived in the following 


dN(ipj 

d<Pi 

dD(<Pi 

dipi 


_ dm 31 (<pi) 

dipt 

= m 32 (pi)sm^>(pi) + m 33 (pi) cos ^(pi) 

- + m 32 (<Pi) cos if/ 


+ 


dip t 
dm 33 {pj) 
dipt 


cos 4 '(pi) - m 33 (<pi) sin 


dpi 

&HPi, 

dpi 


The elements dmkj(pi)/dipi are given by 


dm 


kj 


dpi 


= P 


k* 


dp-VTifri)) 

dpi 


c 


*3 


(3.101a) 

(3.101b) 

(3.101c) 

(3.101d) 


(3.102) 


where the matrices P and C are given by (3.47), and the matrix 0 (■' 1 1 T l (p l )') /dpi is given by (3.86). 
This yields 


dm 31 ( Vi ) „ fdp-VTifa)) ^ 

3* \ o / ^*1 


■i \ dpi 

= (-P31C21 + P32C11 cos «i_i + p 33 cn sin i) cos p t 
- (P3icn +P 32 C 21 cos«j_i + p 33 C 2 i sin aj_i) sin pi 


dm 3 2 (pi 


= P 


r\ •* 3* l r\ 

0<Pi \ 

= (-P31C22 + P32C12 COS «j_i + p 33 C 12 sin i) COS <£>j 
- (P31C12 + T32C22 cos «j_i + p 33 c 22 sin OLi- 1 ) sin p t 


d^-VTiipi)) 


C 


*2 


dm 33 (pi 


= P 


r\ •* 3* l n 

= (-P 31 C 23 + P 32 C 13 COS «j_i + p 33 Ci 3 sin Oj-i) COS <£>j 
- (P31C13 +P 32 C 23 Cosaj_i + p 33 c 2 3 sin «i_i) sin pi 


d^T^)) 


c 


*3 


(3.103a) 

(3.103b) 

(3.103c) 

(3.103d) 

(3.103e) 

(3.103f) 


The Roll Angle &(pi) The state vaiiable 4> is given by (3.18) as 


4fe) = arctaaO^ 

\mn(p 


(3.104) 
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and its partial derivative with respect to the variable d, is equal to 

dipi oipi \mn{pi)J 


1 


8 (m 2 i(pi) 


1 + 


m 2 i(<Pi )\ 2 dl Pi V m n (<Pi) 


m n (<Pi 

1 


(3.105a) 

(3.105b) 


.dm 2 i{ipi) , dm u (tpi 

mn{(pi) rs ^21 vPij 


dpi 


dpi 


1 + 


m.2i{(pi 

m u (<Pi 

1 


1 + 


m u (<pi) 2 

dm 2 i(ipi) m2i(<Pi) dm n (<Pi 


rn.2i{tpi)\ 2 \mn{pi) dpi mu(pi) 2 dpi 

m n (<Pi 


(3.105c) 


(3.105d) 


The elements m/. y (pj) arc elements of the matrix °T n , which are given by ( 3 . 55 ) and ( 3 . 56 ). Computing 
°T/v yields the following expressions for the elements m 2 i (pi) and mn(^) 

mu (Pi) = -P12C31 sin oti-i + (jpucu + pi 3 c 2 i sina*_i) cos^; 

- P11C21 sin ifi + P13C11 sin a*_ 1 sin p t 

+ (P13C31 + P12C21 cos pi + P12C11 sin cos «*_! ( 3 . 106 a) 

m 2 i(pi) = -P22C31 sinctj-i + (p- 2 icn +P23C21 sinc^-i) cos p t 

- P21C21 sin + P23C11 sin a,;_ i sin p t 

+ (P23C31 + P22C21 cos pi + P22C11 sin pi) cos a;_i ( 3 . 106 b) 

The elements dnikj(pi) I dp i are given by 


dm 


kj 


dpi 


= P 


k* 


d^Ti(pi)) 

dpi 


c 


*3 


(3.107) 


where the matrices P and C arc given by (3.47), and the matrix 0 ( 1 ' l Ti(pi)')/dpi is given by (3.86). 
This yields 

'd(^Ti(pi)) 


dm u (pi 
dpi 


dm 2 i (Pi 

dpi 


= P 


1 * 


dpi 


c*i 


= (-P 11 C 21 +P 12 C 11 cosai -1 +P 13 C 11 sin oti-i) cospi 
- {piiCu +P 12 C 21 cosa.i -1 +P 13 C 21 sin«j_i) sin^ 


= P 


2 * 


d(^T t (pj)) 

dpi 


c*i 


= (-P21C21 +P22C11 cosai-1 + P23C11 sin ati- 1) cos pi 
- (P21C11 +P22C21 cos aii-i +P23C21 sin«j_i) sin p t 


(3.108a) 

(3.108b) 

(3.108c) 

(3.108d) 
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3.3.6.3 Summary 

From the above it can be seen that, for a prismatic joint, the partial derivatives of the state variables T. 0. 4> 
with respect to the joint parameter di all vanish, that is, 


<9'P dO <94) 
<9 di ddi ddi 


(3.109) 


This should come as no surprise, because the state variables 4), 0, $ describe the orientation of the end 
effector. A prismatic joint performs only translation, and a translation can only change the position of the 
end effector, but it can not change its orientation. 
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Motion Interpolation 


An animation can be defined as “a motion picture made by photographing successive positions of inani¬ 
mate objects (as puppets or mechanical parts).” The first animations were drawn by hand and then pho¬ 
tographed onto a film strip. In this way, Walt Disney made Mickey Mouse come to life. It is a very 
time-consuming task to draw all the frames by hand, and in the beginning of this industry, cartoon films 
only had twelve frames per second as opposed to the 25-30 frames per second of modern films. A time 
and money saver was the use of in-betweeners. An in-betweener was a less-known artist whose job was 
to till in the blanks left by the famous artists. So the expensive artist would draw up Donald Duck in 
the meaningful poses (key frames), and the in-betweeners would draw all the in-between less important 
frames, thereby making sure that Donald Duck was animated from key-frame to key-frame. 

Today computers arc the in-betweeners. The animation artist places the character or object in the 
key positions and tells the system when the model should be in a specific position. Then the computer 
calculates the positions of the model at each frame, renders it, and saves it. 

In the previous chapters, we showed how to model a complex model and how to help the computer 
calculate specific positions for each object in the model. That is just one example of an automated calcula¬ 
tion. One could also use a simulator to estimate the position and shape of the model. The most basic way 
to animate is to use forward kinematics and no automation, just like the old Disney animators did. The 
focus of this chapter is how to calculate the frames without a simulator or an automated system—although 
they arc often used in conjunction with each other. 

The most basic way to calculate the in-between frames is to interpolate the storytelling poses, the 
key-frames. Simple lineal - interpolation results in a jerky result. The character will walk like a stiff robot, 
or if one is animating a color, it will flicker. 

The first part of this chapter explores how to use higher-order splines instead of simple linear interpo¬ 
lation so as to preserve the continuity over the key-frames and hide the key-frames. The parameter of the 
interpolating curve has no physical meaning. This can present some difficulty to the animator. A more 
intuitive way to think about the interpolating parameter is by using time or traveled distance along the 
spline. The latter part of this chapter shows how to reparameterize the splines so they become a motion 
function of time instead of the spline parameter and have some sort of physical meaning. 

4.1 Key-Framing 

In this section we will show how forward kinematics can be used to animate articulated bodies. It has just 
been shown how the state vector s specifies the position and orientation of the end effector given values 
for the generalized joint parameters 0 \ ,.... t) y. By varying the joint parameters, the articulated figure 
will move. So, by letting the joint parameters be functions of time t, that is, 0 \ (t). : ... . the state 

vector will also be a function of time, s(t), and the articulated figure can be animated by stepping the time 
parameter t, and computing the state vector s(t). 
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4.1.1 Key-Frames 

In practice, it is not easy for an animator to specify all the joint parameters as functions of time, that is, 
0 i ,..., On- A helpful tool is to specify the joint parameters by using what is known as key-frames or the 
storytelling poses. The animator only has to specify the most important configurations of the articulated 
figure and also at what time t, they occur, see Figure 4.1. Having specified the key-frames, that is, the 
joint parameters 0 , at some discrete times the joint parameters at the key-frames can be plotted on a 
timeline, see Figure 4.2. The idea behind using key-frames is that the animator should not specify a huge 
number of joint parameters, but only the ones that arc important for the animation. The joint parameters 
that arc in between the key-frames should be calculated automatically such that the animation looks like 
the animator imagined when he specified the key-frames. 

The only thing that is known is the joint parameters at the key-frames, but the joint parameters between 
the key-frames are not known, so this suggests an interpolation between the joint parameters at the key- 
frames. 

4.1.2 Linear Interpolation 

The first idea that comes to mind is linear interpolation. If linear interpolation is used, the functions that 
represent the joint parameters will look like the ones shown in Figure 4.3. As it can be seen from the 
figure, the joint parameters make some very abrupt changes at some of the key-frames. The functions for 
the joint parameters arc continuous, but not differentiable. Especially, it can be seen for joint parameter 
Oq at the key-frame at time t\. The joint parameter 0 O varies linearly from 45° to 90°, and then it abruptly 
changes and returns to 45 °. The same can be seen at time t 2 where the joint parameter 9q varies linearly 
from 90 ° to 45 ° and then it stops immediately. 

At the key-frames, the function is continuous but not differentiable, and that might look unnatural to 
the beholder of the animation. Worse, if the joint parameter functions of the animation was to be used for 
an actual industrial robot, the forces at time t \ might be so big that the robot would break. 

4.1.3 Spline Interpolation 

As it was discussed in Section 4.1.2 linear interpolation is neither adequate for animation nor for sim¬ 
ulation of industrial robots. The reason is that the functions representing the joint parameters must be 
differentiable. Otherwise, the animation will look unnatural or the industrial robot will break. 

The idea is now to make a continuous differentiable curve, which passes through the joint parameters 
at the key-frames, that is, the curve must be continuously differentiable and also pass through the points 
specified by the joint parameters at the key-frames. 

To do this job there is a family of curves named interpolating splines. So, in order to get a smooth 
function for the joint parameters, we use the parameters at the key-frames as points that must be interpo¬ 
lated and use a spline to interpolate them. A result of such a spline-interpolation is shown in Figure 4.4. 
At first, it seems strange that the curves of the joint parameters have some strange bends. They are nec¬ 
essary because the curves must be continuously differentiable. How much the spline bends and wiggles 
depends on which type of spline and how many control points arc used for controlling the spline, see 
Figure 4.5, where there arc used as extra control points for the spline. Given access to an interactive tool 
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(a)i 0 : do = 45°, 01 = -135° (b) ii : 0 O = 90°,0i = - 



(c )t 2 : 0 O = 45 °, 6 »i = - 45 ° (d) t 3 : 0 O = 45 °, 0 i 

Figure 4.1: The configuration of an articulated figure at times to, 
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Figure 4.4: The functions of the joint parameters arc interpolated between the joint parameters at the key- 
frames by a smooth interpolating spline. The control polygons used arc those from the linear interpolation 
shown in Figure 4.3, i.e., just the known joint parameters at the key-frames. The control polygons arc 
shown as dashed lines. 


0i 



Figure 4.5: The functions of the joint parameters arc interpolated between the joint parameters at the key- 
frames by a smooth interpolating spline. Here, extra control points arc used to control the spline, which 
gives a more smooth curve without wiggles. The control polygons arc shown as dashed lines. 
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where it is possible to choose the type of interpolating spline and to adjust the chosen splines parameters, 
an experienced animator might obtain very visually pleasing results. 

However, using spline interpolation introduces a little inconvenience for the animator. Looking at 
Figures 4.4 and 4.5 the curves for the joint parameters 9 1 and 62 look as if they arc functions of time t, but 
they arc not. The curves arc given as a vector function (t ( u). 0 l (u)) T , where the coordinate functions arc 
functions of the spline parameter u, which is not necessarily intuitive for an animator. 

That means if the animator wants to compute the value of a joint parameter 0, at some given time t, 
this must be done in two steps: 

1. Compute the spline parameter u corresponding to the time t. This can be done numerically as a 
root-finding problem, and because the function tiu) is a monotone increasing function, there is a 
unique solution. That is, find u such that ||/(u) — 1 1| < e using any root-finding algorithm. 

2. Compute the joint parameter 0, corresponding to the spline parameter u, i.e., 0 t (u). 

4.2 Scripted Motion Using Splines 

Imagine the following scenario: an elevator is riding up and down between the first and fifth floor in a 
house. Inside the elevator is a person who throws a ping-pong ball against the elevator floor and catches 
it when it pops back up. When the ping-pong ball and the elevator floor collide, the ball is affected very 
much by the collision, but the elevator is not. The elevator will continue to move as if the ball had not hit 
the floor. 

To animate such a scenario on a computer, it would be a waste of computer power to animate both 
the elevator and the ping-pong ball as physically correct. Only the ping-pong ball should be animated 
physically correct. The motion of the elevator could be modeled by a 3D curve, e.g., a spline, describing 
the location of the elevator during the animation. 

A rigid body whose motion is described this way is called a scripted body. Modeling a scripted body 
as described above might be done by key-frames analogous to the description in Section 4.1. Specify some 
3D positions the scripted body must pass through, and then compute an interpolating spline to describe 
the trajectory of the scripted body. 

4.2.1 The Basic Idea 

The trajectory is a 3D curve C(u) = [x(v.) y(u ) z(u)\ 7 parameterized by the spline parameter u, and 

it has the same inconveniences as described in Section 4.1.3. The spline is parameterized by u and not 
time t. 

Animation is about how a point p moves in time. More specifically, it can be stated: 

1. What is the location of point p at time /? 

2. What is the velocity v of point p at time /? 

3. What is the acceleration a of point p at time /? 

In order to make it easier for an animator to control the scripted body, the spline C can be reparame- 


© 


© 


© 


© 



“book” — 2005/9/30 — 15:44 — page 77 — #89 


4.2 PHYSICS-BASED ANIMATION 


77 


terized such that it becomes a function of time t. 

This reparameterization can be done in two steps. First, express the spline C as a function of the arc 
length s, and then express the arc length s as a function of time t. 


1. Express the spline C as a function the arc length s. That is, construct a function U : M —» M 
which maps the arc length to the corresponding spline parameter u. 

2. Express the arc length s as a function of time t. That is, construct a function S : M —» M which 
maps time t to arc length s. 

Using the above reparameterizing, the original spline C (u) can be expressed as a function of time t 


C(U(S(t))) i ► p = 


x(u(sm 

y(u(S(t))) 

z(u(sm 


(4.1) 


This means that an animator can control the position p, the velocity v, and the acceleration a as shown 
below 


P(t) 

v(t) 

a{t) 


C(U(S(t ))) 

dt dt v du d-s dt 


d 2 p(t ) d 2 


dt 2 

d 2 C 

du 2 


dt2 < cmsm) - j t 


dC_dUdS_ 
du ds dt 


dUY 

ds ) 


fdS_ 

I dt 


d(JdUfS_ dC d 2 U 
du ds dt 2 du ds 2 



Some of the derivatives 


dC 

du 


and 


d 2 C 
du 2 


(4.2a) 

(4.2b) 

(4.2c) 
(4.2d) 

(4.3) 


arc easy to compute, because they arc given directly by the definition of the original spline C. The symbol 
dC/du means the spline C differentiated with respect to the original spline parameter u, analogous to the 
symbol d 2 C/du 2 . The other derivatives 


dU fU_ dS_ d 2 S 

ds ds 2 dt dt 2 


(4.4) 


arc difficult to compute because the functions U(s) and S(t ) can not be expressed analytically, but must 
be computed numerically. 

The following sections will describe how to reparameterize the spline C with the arc length s, how to 
reparameterize the arc length with time t, and how to compute the derivatives in (4.4). 




Figure 4.6: The arc length s as a function of the spline parameter u. 


4.2.2 Reparameterizing with Arc Length 

Let the trajectory of some point p = \x y z\ T be given by a space-spline C : M —> M 3 , and let the 
spline be parameterized by a global parameter u 

C(u) i— > p(u) = [x(u) y(u) z(u)] T (4.5) 

Flowever, the parameter u is not intuitive to use for humans. It is quite difficult for a human to predict 
exactly which point on the spline corresponds to a given value of the parameter u. Humans are much 
better at thinking in terms of the arc length s of the spline C. The arc length s of the spline C can also be 
explained as the distance traveled along the spline. The arc length s of C is given as the integral 

s{u) = [ du (4.6) 

Jo du 2 

where \\C{u)/du\\ n is the length of the derivative of C with respect to the parameter u. 

In order to reparameterize the spline C with the arc length s a function U : M —> M was introduced 
in Section 4.2.1. The function U(s) maps the arc length s to the corresponding spline parameter u. This 
means that the function U is really the inverse of the arc length function from (4.6). 

As we will show later, the arc length function (4.6) can not be computed analytically for a spline curve. 
Therefore, it is not possible to obtain an analytical expression for the inverse function U. The arc length 
(4.6) is a monotone function as shown in Figure 4.6, so the value u = U(s) can be computed numerically 
as follows: 

1. For some specified arc length s find the corresponding parameter value u such that s(u) = s. 

2. Given s compute u such that || s(u) — s|| < e using a root-finding algorithm and (4.6). 
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3. Having computed the value u, compute C(u). Using this procedure, points on the spline C can be 
computed using the arc length s as parameter. 

This corresponds to expressing the spline parameter u as a function of the arc length s 

u = U(s) (4.7) 

and use it to compute a point p on the spline p(s) = C(U (s)). 

This reparameterization makes it easier to predict how the scripted body moves along the spline, 
because the spline parameter used is the arc length, which is the traveled distance along the spline. 


4.2.2.1 The Arc Length Integrand 


Let C(u) be a cubic spline in 3-dimensional space 



x(ll) 


a x u 3 + b x u 2 + c x u + d x 

C{u) = 

y(u ) 

= 

a y u 3 + b y u 2 + c y u + d y 


z(u)_ 


a z u 3 + b z u 2 + c z u + d z 


Alternatively in matrix notation 


C{u) = 


&X 

°y 

CLz 


bx 

by 


Or 

Gy 

C z 


d x 

dy 

d z 



£ 

CO 


u 2 


u 

- 

1 


The arc length function (4.6) of the cubic curve then becomes 

pu| | dC{u) 


s(u) = 


i o 


dn 


du 


= / v Au A + Bu 3 + Cu 4 + Du + Edu 

Jo 


where 


A = 9(a^ + dy + a 2 z ) 

B = 12 (a x b x + a.yby + a z b z ) 

C = 6 (a x c x + ciyGy + a z c z ) + 4(6^ + by + 6?) 
D — 4 {b x c x + byCy + b z c z ) 

E = {cl + Cy + (%) 


(4.8) 


(4.9) 


(4.10a) 

(4.10b) 


(4.11a) 
(4.11b) 
(4.11c) 
(4. lid) 
(4. lie) 


Unfortunately, it is not possible to find an analytical expression for this integral, so it is necessary to do a 
numerical integration in order to find the value of s(u) for a given a value of u. The function 


dC{u ) 
du 


\J Au A + Bu 3 + Cu 2 + Du + E 


(4.12) 
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Figure 4.7: The traveled distance s plotted against time t as a number of discrete s*) pairs. 

is the arc length integrand. By applying Horner’s rule for factoring polynomials (4.12) can be rewritten to 
a more computationally friendly form 


= \J (((Ait + B)u + C)u + D)u + E. (4.13) 

With the theory developed so far, one can pick any numerical integration method, which fulfills the re¬ 
quirements to performance and accuracy and then apply it in order to compute the arc length s(u). 

4.2.3 Reparameterizing with Time 

To obtain an even better feeling of how the scripted body moves let S' : M —> M be the arc length 
expressed as a function of time t. 

An animator knows how a point p should move over time, so the function S can be specified by 
a number of time-distance pairs (tj, s, ) at discrete points in time as shown in Figure 4.7. The discrete 
(ti, Si) pairs might be interpolated by a spline V : M —> M 2 whose coordinates are functions of the spline 
parameter v as shown in Figure 4.8. 

V(v) = [t(v) s(n)] T (4.14) 

Now, the arc length function S(t) can be computed as a function of f as follows: 

1. Given a value t find the coiTesponding value v such that | t(v) — 1 1| < e, using a root-finding 

algorithm. Time t is a monotone increasing function, so a unique solution exists. 

2. Compute the second coordinate function s(v) of V. 


dC (u) 
du 
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Figure 4.8: The discrete ( ti,Si ) pairs arc interpolated by a spline V : R —> R 2 yielding a continuous 
function (t(v), s(v)) T which can be used to compute the traveled distance S(t) as a function of time t. 
The slope of the spline is equal to the velocity of the point traveling along the curve. 


This corresponds to expressing the spline parameter v as a function of t, and use it to compute the arc 
length s = s(v(t)) = S(t). This arc length can now be used to compute the parameter u for the spline 
C{u) as described previously. What has been obtained is the computation C{U(S{t))), which computes 
points on the trajectory of the scripted body using time t as parameter. 

An articulated figure might be animated using scripted motion as follows. Consider the end effector 
of an articulated figure as a scripted body, and let its trajectory be specified by a space-spline CJ(t') as 
explained previously. 

Then, by incrementing the time parameter t with small time steps At. the corresponding points on 
the space-spline C(t ) might be computed as described above. Finally, the actual joint parameters can be 
computed using inverse kinematics. This way, it is possible to animate the articulated figure as a scripted 
body. 


4.2.4 Computing the Derivatives 

From (4.3) and (4.4) it is seen that the following values need to be computed 


dC d 2 C 

dU 

d 2 U 

dS 

and 

d 2 S 

(4.15) 

du du 2 

ds 

ds 2 

dt 

dt 2 

The derivatives dC/du and d 2 C/du 2 arc 

easily 

computed 

from the analytic expression of the spline 

C(u). This leaves the remaining four derivatives 






dU 

d 2 U 

dS 


d 2 S 



ds 

ds 2 

dt 

and 

dt 2 


(4.16) 
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Analytic expressions for the functions S and U arc not available, but fortunately this is not the case for 
their derivatives, which can be computed analytically. 

Stalling with (4.6) the integrand can be rewritten as the dot product of the derivative of the space-spline 


s(u) = r 

Jo 


C( U ) 


du 


du = 


/ dC{u) 

dC(u) 

V du 

du 


du, 


(4.17) 


which is a more convenient form. Differentiating the arc length function s(u) with respect to the parameter 
u, is the same as differentiating the integral as a function of its limits. This yields 


ds(u) 

du 


/dC(u) 

dC{u) 

V du 

du 


(4.18) 


Recall, the function U(s) is the inverse of the arc length function s(u) given in (4.17). Therefore, the 
derivative of U ( s ) with respect to the arc length 

dU(s) 1 


ds 


ds 

du 


(' U(s)) 


; equal to 


1 


/dC(U(s )) 

dC(U(s)) 

V du 

du 


(4.19) 


From the above, it can be seen that although the function U (s) can only be computed numerically, it is 
possible to compute its derivative, because dC/du is given by an analytic expression, and U(s) can be 
computed numerically. Differentiating this derivative with respect to s yields 


d 2 U(s) d 


( 


\ 


ds 2 ds | 

v\ 

1 dC(U(s)) 

dC{U{s )) 


1 du 

du 

d I 

UC{U{s)) 

dC(U(s)) 

ds ' 

y du 

du 


2 ( dC(U(s)) dC(U(s )) \ 3/2 

\ du du ) 

(d 2 C{U{s )) dC{U(s)) dC(U{s)) d 2 C{U{s))\ dU(s) 


(4.20a) 


(4.20b) 


V du 2 


du 


du 


du 2 


ds 


f dC{U{s)) dC(U{s)) \ 
\ du du ) 


3/2 


(4.20c) 


After some rearranging of terms it is found that 


d?U{s) 

ds 2 


dC(U(s)) d 2 C(U(s)) 


du 

du 2 

(dC(U{s )) 

dC(U{ S )) y 

\ du 

du J 


(4.21) 
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The derivatives dC/du and d 2 C/du 2 arc given by analytic expressions and U(s) can be computed nu¬ 
merically, so the derivative d 2 U{s)/ds 2 can also be computed. 

Recall that the reparameterization of the arc length with the time S it ) was done using a spline V given 
by (4.14) and repeated here for convenience 


V{y) = [t(v) s(v) 


(4.22) 


Since the function V is a spline, so is each of its coordinate functions. This implies that analytic expres¬ 
sions for 

and tlM. (4.23) 

dv dv 

can be found by straight forward computations. In fact, these would be second-order polynomials when 
V is a cubic spline. 

Recall that given a value of t the corresponding value of the spline parameter v could be found nu¬ 
merically by a root search. This corresponds to expressing the spline parameter v as a function of time 
t. Denote this function v(t). With this notation the arc length can be expressed as a function of time 
S(t) = s(v(t)). The derivatives of S with respect to time is then given by 

dS(t) = ds(v(t)) = ds(v(t)) dv(t) 
dt dt dv dt 

Looking at the last term, it can be seen that this is in fact the inverse mapping of (4.23). Therefore, the 
above expression can be written 

dS(t) ds(y(t )) 1 

dt dv dt. ... ^ 

The value of v(t) is already known. It was found by the root search that was performed in the time 
reparameterization phase of the space-spline. 

The second derivative of the function S with respect to t can be computed as follows. From (4.25) it 
is found that 

d 2 S(t) d 2 s(v(t)) 

~nv~ = <4 - 26a) 


(4.25) 


d 2 S(t ) 
dt 2 


(4.26a) 


d 2 s(v(t )) dv{t) 1 
dv 2 dt dt(y(t )) 
dv 


ds(v(t )) 


dt(v(t)) 

dv 


d (dt(v(t)) 
2 dt \ dv 


d 2 s(v(t )) 
dv 2 

dt(v(t)) ^ 
dv j 
d 2 s(v(t)) 
dv 2 

dt(v(t)) ^ 
dv ) 


ds(v(t )) 

dv _ d 2 t(' 

^ dt(v(t)) ^ 2 dn 

ds(v(t)) d 2 t(v(t)) 

dv _ dv 2 

f dt(v(t))\ 3 


d 2 t.(v(t)) dv(t) 


(4.26b) 


(4.26c) 


2 


(4.26d) 
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The value v(t) is known so the above expression is an analytic expression evaluated at v(t). 
Looking at all the equations for the derivatives, it is realized that whenever 


dC{U(.s)) 

du 


= 0 , 


or 


dt(v(t )) 
dv 


= 0 , 


(4.27) 


might cause problems. However, it turns out these degenerate cases do not cause any problems in practice. 
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Simulation of multibody dynamics deals with simulating the motion of multiple rigid bodies, possi¬ 
bly connected to each other through joints. The various fields concerned with multibody dynamics have 
developed slightly different language usages. The computer graphics literature often uses the term rigid 
body simulation and the applied mathematical and engineering literature uses the term multibody dynam¬ 
ics. Therefore, it is appropriate to begin our discussion on multibody dynamics with a clarification of the 
terms we have decided to use in this book: multibody simply means multiple bodies. Multibody Dynamics 
is the physics of multiple bodies all in mutual contact with each other or possibly connected to each other 
by joints. By simulation of the multibody dynamics we get a Multibody Animation. 

The terms articulated figure, articulated body, or jointed mechanism arc often used for rigid bodies 
connected by joints. Sometimes the term rigid body simulation is used for those simulators that do not 
handle jointed mechanisms. Thus, multibody animation is the most general term covering every topic and 
scenario you can think of concerning rigid bodies. 

Particles and rigid bodies arc often the first concepts introduced in physics, and they arc regarded as 
the basic theoretical building blocks. The same view is also often applied in physics-based animation: 
rigid body simulation is considered the basic starting point for many practitioners. 

Many consider multibody dynamics to be simpler to simulate than chaotic and turbulent natural phe¬ 
nomena such as water, smoke, or fire. In our opinion this is a misconception, and although the free motion 
of a single rigid body floating in empty space is embaixassingly simple to simulate, rigid bodies arc ideal 
real-world models. However, in the real world everything deforms. For rigid bodies, this idealization 
results in difficult discontinuities, causing the numerics to become ill-conditioned and sometimes even 
unsol vable. 

The dynamics and mathematics of rigid bodies have been known since Newton. Nevertheless, nearly 
400 years later it is still a topic of active interest and scientific publication. Since the 80s there have been 
papers on rigid body simulation in every ACM SIGGRAPH Proceedings. The explanation is that even 
though the physics and mathematics is well established, it is not easily solved on a computer. Further¬ 
more, there is a continuous demand to simulate more and more rigid bodies faster and faster. Finally, 
in animation we often want to simulate unreal things that should appeal - plausible. Hence methods are 
needed that are incredibly fast, stable, robust, and tolerant for faulty configurations, none of which is the 
main focus in classical mechanics. 

The traditional approach for analyzing systems in classical mechanics often deals with systems in 
equilibrium. This is of little interest in animation, where we want to see objects in motion and colliding 
with each other. At equilibrium, the animation is usually over. In contrast, robotics has a long tradition of 
simulating mechanics; an equilibrium state is often the goal. Further problems are more concerned with 
kinematics of a single mechanism, controlling, planning, or computing the motion trajectory in a known 
and controlled environment. In animation, nothing is known about what should happen, and often only 
animation of several mechanisms is interesting. Besides, robotics tends to be aimed at only simulating the 
joints of a mechanism, not contact forces with the environment or other mechanisms. 

The quest for bigger and faster simulators seems endless and is mainly driven forward by the computer 
gaming industry and the movie industry. Rigid bodies are attractive in computer games, since they fit in 
nicely with the level of realism, and they are fairly easy to use to plan game events and build game 
levels. Basically, the industry knows how to use rigid bodies to create interesting game effects. The movie 
industry has moved beyond multibody animation to deformable objects, natural phenomena, humans, etc. 
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This part is a thorough introduction to multibody animation. The theory and methods we present arc 
highly biased toward our own work in the field, and readers should not expect a complete, in-depth, and 
detailed walk-through of every method and paradigm. 

Our contribution on multibody animation in this book begins with the simplest possible approach to 
model multibody dynamics: the so-called penalty-based methods. Penalty-based methods were among 
the first to appeal - and are still in use today. Then we will turn our attention toward the impulse-based 
methods, which are considered easy to implement compared to other methods. Impulse-based methods 
are characterized by the fact that they represent every interaction between bodies in the world through 
collisions; even a cup resting on a table is modeled by a series of highly-frequent collisions. Constraint- 
based methods are probably the kind of method that resembles a physicist’s view of the world. Constraint- 
based methods set up the motion of equations, and solve explicitly for the contact and constraint forces, 
considering that Newton’s second law can be integrated to calculate the complete motion of the objects. 

After we have introduced the reader to the three traditional “main-stream” methods for simulating 
rigid bodies, we will describe a formal, conceptual module design. The module design is used to explain 
many details of rigid body simulators, such as interaction with the collision detection engine or the time¬ 
stepping strategies. Finally we will present contact graphs, a convenient data structure to store information 
about contact points. 

Our aim with the multibody dynamics theory presented in this book is to equip the reader with the 
skills for building stable and robust multibody simulators capable of simulating large configurations with¬ 
out breaking down or giving up on unexpected errors or faulty starting conditions. We have focused on 
high performance, but it’s more from an algorithmic point of view, and not our most important concern. 
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Penalty-Based Multibody Animation 


Rigid bodies in the real world do not penetrate each other, and the penetration is impossible due to the 
contact forces between rigid bodies. In penalty-based multibody animation, a spring-damper system is 
used for penalizing penetrations. A spring-damper system behaves like a harmonic oscillator in classical 
mechanics, which will be reviewed here. For full detail see Section 22.4. 

Penalty methods have a lot in common with mass-spring systems, because all effects arc modeled by 
springs and dampers, see for instance [Provot, 1995], and they arc generally applicable to both deformable 
and rigid objects. Spring-damper models arc even found in biomechanical muscle models such as Hill’s 
muscle model and Zajac’s force model, see [Chen et al., 1992]. 

This chapter covers rigid bodies, but the theory is generally applicable. We strongly encourage readers 
new to the subject of classical mechanics to examine Section 22.1 for basic theory of the motion of rigid 
bodies. 


© 


5.1 The Basics 


The motion of a single rigid body is described in the Newton-Euler equations. These arc derived in full 
detail in Section 22.1. Here, we will briefly review the notation and equations. 

The center of mass is given by r, the orientation is given by the quaternion q and the linear and angular 
velocities arc given by v and u> 


r 


V 

Q 



V 


a 

u! 


a 


(5.1) 


where a and or arc the linear and angular acceleration, and can be found by 


F 


a = — 

m 


and a = I 1 (r + u x Ju>) . 


(5.2) 


Here m is the total mass and / is the inertia tensor. Furthermore, F is the total linear force acting on the 
center of mass and r is the torque w.r.t./ the center of mass. The ordinary differential equation (ODE) in 
(5.1) is called a Lagrangian formulation. Given initial conditions, one can use a numerical integrator to 
integrate the motion of the rigid body. See Chapter 23 for details about numerical integration. 

The Hamiltonian formulation of the motion of the rigid body is an alternative to the Lagrangian for¬ 
mulation and is given by 

d_ 

dt 


89 


r 


V 

Q 


\uq 

P 


F 

L 


T 


(5.3) 
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where P and L arc the linear and angular momentum respectively. In the Hamiltonian formulation v and 
u> is found by 

p 

v = — and uo = I~ 1 L (5.4) 

m 

As with the Lagrangian formulation the Hamiltonian formulation can also be numerically integrated. 

Regardless of the chosen formulation, one needs to find the values of the force F and torque r at a 
given instant in time in order to perform the numerical integration. These values would include contri¬ 
butions from external forces, such as gravity, and contact forces stemming from contact with other rigid 
bodies. 

In order to compute, the contact forces springs with rest-length zero arc inserted at all penetrations as 
shown in Figure 5.1. The springs will try to remove the penetrations, which is why they arc called penalty 
forces. Furthermore, larger penetrations means larger spring forces. That is, springs penalize penetrations, 
hence the name penalty method. The contact force from each point of contact is thus computed as a spring 
force. Call the external contributions / ext and r ex t and the i’th spring force / spring and T spr jn g! . Now the 
total force and torque arc computed as 


^ f ext ^ ^ f springy 

i 

(5.5) 

'T — T e xt + ^ ^ T springy 

(5.6) 
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In case of gravity being the only external force, we have 

/ext = m 9 (5.7) 

Text = 0 (5.8) 

where g = [0, —9.81, 0] 7 is the gravitational acceleration. 

In its most pure form the simulation loop of the penalty method can be summarized as 

• Detect contact points (run collision detection) 

• Compute and accumulate spring forces 

• Integrate equations of motion forward in time 

This is the pure form of the penalty method. In practice, it is combined with several other techniques, 
some of these arc surveyed in Section 5.9. 

Now we’ll turn our attention toward the actual computation of the penalty forces in a general simulator. 
Let the k'th contact point between the two bodies i and j with center of mass at r , and r 3 respectively be 

at position p k given in 3D world coordinates. The fc’th contact point has the contact normal, n k , also in 

3D world space, pointing from body i toward body j, and a measure of penetration depth, d k . Then the 
penalty force acting on body j is given by 

Fj = (- kd k - bu k ■ n k ) n k , (5.9) 

where u k denotes the relative contact velocity at the /;:‘th contact point, i.e., 

u k = (Vi + a>i x r ki ) - (vj + ujj x r kj ) , (5.10) 

where r kl = p k — r, and r kj = p k — r :j . For u k ■ n k < 0 objects are approaching each other, u k ■ n k = 0 
there is no relative movement, and if u k ■ n k > 0 objects arc moving away from each other. Notice how 
care fully the viscosity is modeled only to work in the direction of the contact normal. It was pointed out 
in [Baraff et al., 2003a], that damping should only work in the constraint direction. The force acting on 
body i is by Newton’s third law, 

E, E j. (5.11) 

Besides the linear force terms arising from the penalty force, there arc also some torque terms which must 
be taken into account. 


Tj = r kj x Fj (5.12a) 

Ti = r k i x F,- L . (5.12b) 

So far we have derived what can be termed the simple or classical penalty method. There arc. however, 
several difficulties associated with it. 
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A 


4 




Figure 5.2: Figure illustrating that it is not always easy to determine penetration depths and contact nor¬ 
mals based on local information only. 


• First of all it is not trivial to compute satisfactory contact normals or penetration depths. This is 
illustrated by Figure 5.2. Distance fields arc ideal for these computations, but they lack global 
knowledge about the entire state. 

The problem of determining meaningful contact normals and penetration depths is mostly caused 
by the use of local computations as shown in the figure. If a global computation [Kim et al., 2002] 
is used instead, these problems can be resolved. 


• Some contact points like the face-face case appealing in a box stack is difficult to handle by applying 
a penalty force at the contact point of deepest penetration [Hasegawa et al., 2003], as illustrated in 
Figure 5.3. To alleviate this problem, researchers have tried to sample the entire contact region with 
contact points as shown in Figure 5.4. There also exist methods, that integrate over the intersection 
area and/or volume. 


• Figure 5.2 illustrates another problem with contact normals, in the left column a “healthy” state is 
shown, the right column shows a slightly more displaced state. Observe that from the left to the 
right state, the contact normals make a discontinuous change; the problem is treated in detail by 
[Hirota, 2002, Hirota et al., 2001]. 





Figure 5.3: Figure illustrating a resting box, undergoing an unwanted and endless wiggling. 



Figure 5.4: Figure illustrating the idea of oversampling to alleviate the wiggling problem. 
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Figure 5.5: The one mass harmonic oscillator. 


5.2 The Harmonic Oscillator 

In Section 5.1 we showed how penetrations were penalized with spring-damper systems. A spring-damper 
system is equivalent to a damped harmonic oscillator. In order to study the behavior of the penalty method 
we should study the behavior of the harmonic oscillator. 

In this section we will briefly review the harmonic oscillator. The reader is referred to Section 22.4 
for more detailed treatment. 

5.2.1 The One Object Harmonic Oscillator 

Consider a particle attached to a wall of infinite mass with a spring as shown in Figure 5.5. The spring 
force given by 

-^spring — kx, (5.13) 

is called Hooke’s Spring Law, and k > 0 is called the spring coefficient. Newton’s second law of motion, 
dictates the motion of the particle as, 

mx = — kx, (5.14) 

which yields the second-order ordinary differential equation 

mx + kx = 0. (5.15) 

An analytical solution to this equation exists and can be shown to be 

x = B sin (ojq t) + C cos (cuq t ), (5.16) 
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Undamped Harmonic Oscillator 



Figure 5.6: An undamped harmonic oscillator 


where 



(5.17) 


V m 

is known as the natural frequency. The constants B and C can be evaluated from the initial position and 
velocity of the particle. The analytical solution (5.16) can be rewritten as a single cosine function, 

x = A cos (u>ot + <f>), (5.18) 


The particle follows a cosine motion with amplitude A, frequency wq, and phase <f>. The motion is conse¬ 
quently repetitive with a period T, 


T = 



(5.19a) 

(5.19b) 


Figure 5.6 shows an example of a harmonic oscillator. The idealized harmonic oscillator assumes that 
there is no loss in energy over time. This is physically unrealistic, and practically not as useful as a 
damped harmonic oscillator. A typical damping force is given by 


■^damping — bx, (5.20) 

which is commonly referred to as a linear viscosity force, and the coefficient b > 0 is called the damping 
or viscosity coefficient. From Newton’s second law of motion, the motion of the damped particle must 
obey, 

mx = —kx — bx, (5-21) 
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which is equivalent to the second-order ordinary differential equation 

rnx + bx + kx = 0 , 


or 


b k 

x -|- x H- x = 0 . 

m m 

Using the natural frequency lcq = ^ and introducing 7 = this may be written as, 


x + 'yx + u>qX = 0 . 


(5.22) 

(5.23) 


(5.24) 


An analytical solution exists and is simply stated here as, 

x = A exp cos + (j >), 

where A and 0 arc constants determined from initial conditions, and 

1x1 = 

is the frequency. The frequency uq has real solutions when 

ul - y > 0 
k b 2 

=> -1—9 ^ 0 

m 4 m- 

=> 4km — b 2 > 0 



(5.25) 

(5.26) 

(5.27a) 

(5.27b) 

(5.27c) 


Comparing the damped harmonic oscillator (5.25) with the undamped harmonic oscillator (5.18) it is seen 
that the amplitude of the damped oscillator is exponentially decaying and that the frequency lo\ of the 
damped oscillator is less than the frequency, ccq, of the undamped oscillator. 

The zero-crossings of the damped harmonic oscillator naturally occurs at equal time intervals by the 
period 

T = —. (5.28) 

OJl 


Surprisingly, the peaks of the motion do not lie halfway between the zero-crossings 
ure 5.7. The damped motion may be described qualitatively by examining the value 
ratio of 

7_ 

u)i ' 


as illustrated in Fig- 
of b 2 — 4ink and the 

(5.29) 


The motion is: 


Overdamped, when b 2 — 4mk > 0. Two cases of over-damping can be distinguished: 
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Lightly Damped Harmonic Oscillator 



Figure 5.7: A lightly damped harmonic oscillator 


Lightly damped, if ^7- -C l then A(t) decays very little during the time the cosine makes many 
zero-crossings. 

Heavy damped, if is large then A(t) rapidly goes to zero while the cosine makes only a few 
oscillations. 

Critical damped, when b 2 — 4 mk = 0. In this case the amplitude tends most rapidly to zero (without 
crossing zero) and flattens out. 

Underdamped, when b 2 — 4 mk < 0. In this case the amplitude decays even faster than in the case of 
critical damping, but the amplitude will cross zero and increase again before flattering out. 

Examples of the four categories are illustrated in Figure 5.8. Using a harmonic oscillator for a mass¬ 
spring system to model penalty forces, reasonable motion is obtained for heavily damped or at best critical 
damped, since oscillating forces such as oscillating contact forces arc often undesirable. Furthermore, it is 
often desirable to require that the amplitude should decay to zero within a time-step, since this will make 
objects appeal - rigid. 


5.2.2 The Two Object Harmonic Oscillator 

Consider a two-body system, as shown in Figure 5.9, where two particles are connected with a spring. 
Hooke’s law for this system is given as the following two equations, 

rriiXi = k (xj — xy), (5.30a) 

rrijXj = —k (xj — Xi) . (5.30b) 
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Lightly Damped Harmonic Oscillator Heavily Damped Harmonic Oscillator 





Figure 5.8: A qualitative analysis of a damped harmonic oscillator. 



Figure 5.9: The two-mass harmonic oscillator. 
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Dividing the first equation by m,; and the second by rtij, yields 

(5.31a) 
(5.31b) 

and subtracting the first from the second, yields 


Xi = k— ( Xj - Xi ), 
mi 

Xj = —k — (xj — Xj ), 

m.j 


nr* . _ nr* . - - In _____ I nr* . _ nr* . \ _ Is* _____ I nr* . _ nr* . | 

*Xj *Xj 2 J Dj J ) 


m 


•3 


mj 


= [ — + — ) ( X J ~ X i) ■ 


mj mj 


(5.32a) 

(5.32b) 


The two-object harmonic oscillator has the same motion as the one-object harmonic oscillator. This may 
be seen by setting, x = Xj — Xj, in (5.32b), such that 


x = —k 


1 1 

m,j mi 


x 


Introducing the reduced mass as, 

mimj 

P = --- 

mi + mj 


it is discovered that 


fj,x = —kx. 

We conclude that the two-object harmonic oscillator behaves as a one-object system of mass ji. 


(5.33) 


(5.34) 

(5.35) 


5.3 Picking Parameter Values 

Using a harmonic oscillator to model penalty forces, reasonable motion is obtained in case of a heavily 
damped or critical damped harmonic oscillator, since oscillating forces such as oscillating contact forces 
are undesirable. Furthermore, it is desirable to require that the amplitude should decay to zero within a 
single frame computation, since this will make objects appeal - rigid and ensure that an observer will not 
notice any oscillating behavior. 

Thus we need a method for picking the values of the spring and damping coefficients, so that we can 
control the behavior of the harmonic oscillator to achieve the desired motion. That is, we want to pick 
parameter values such that we are not able to see the springiness of the corresponding mass-spring system. 

The oscillation cycle of a harmonic oscillator, i.e., the time period (5.28), is given by 
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To avoid temporal aliasing the time-step At between two frames must be smaller than \T according to 
Nyquist sampling theory [McClellan et al., 1998]. That is 

At < (5.37) 

This knowledge can, for instance, be used to determine a suitable spring coefficient k, that will allow us 
to use the maximum frame time-step without temporal aliasing. That is 


At = 


T 

2 


7r 
U)l 



(5.38) 


Rearranging, we find that 

At \f^ 

Squaring both sides and isolating uq 2 we get, 



At 2 



= 7T 2 . 


(5.39) 


(5.40) 


Using Wq = A and 7 = — we find that 


It is thus concluded that 


k_ 

m 

k 


At 2 4m 2 

m.TT 2 b 2 
At 2 4 m 


(5.41) 

(5.42) 


This approach is used in [Hasegawa et al., 2003], but it does not provide us with any means for picking 
the coefficient of damping b, i.e., it is impossible to control the rate of the decay. 

In order to get a method that also allows control of the coefficient of damping, we will start by deriving 
the conditions for a highly damped oscillator. For such an oscillator we must have that the ratio in (5.29) 
goes to infinity, 

— = c —> 00. (5.43) 

u>i 

Substituting 7 = A and (5.26) we derive 


b_ 

m 



(5.44) 


Squaring yields 


b 


2 


b 2 
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Substituting (5.17) and collecting terms gives 






Taking the limit for c going to infinity, gives 


k = 


4 m 


(5.46a) 

(5.46b) 


(5.47) 


Observe that this is in fact a critically damped oscillator , because we have b 2 — Arrik = 0, so we have 
shown that the limiting case of the heavily damped oscillator is in fact the critical damped oscillator, as 
expected. 

We’ll investigate the critical damped harmonic oscillator further by looking at the frequency of it 


UJl 




- \J4mk — b 2 

2m 


0 . 


(5.48a) 

(5.48b) 

(5.48c) 

(5.48d) 


This means that the frequency vanishes, and there is no oscillation. Observe that this indicates an infinite 
long period, since 

2 vr 

lim T = lim — = oo. (5.49) 

uj\ —>0 oj\ —>0 0J\ 

Using these facts about the critical damped harmonic oscillator, we see that the equation describing the 
motion of the harmonic oscillator (5.25) reduces to a simple exponential decay: 

x = A exp (©cos (u\t + 4 >). (5.50) 

In our case u q = 0, and the phase shift is zero since at time t = 0 the spring will have maximum amplitude 
equal to the penetration depth between two rigid bodies. Thus we conclude that 

x = A exp . (5.51) 

Exponential decay is often characterized by the time r required for the amplitude of the signal to drop to 
e _1 ~ 0.37 of its initial value. That is 

exp (-1) = exp (5.52) 
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This time is called the characteristic time or the damping time [Kleppner et al., 1978]. For our purpose r 
becomes a parameter for controlling the rate by which the signal tends to zero. For the moment we will 
let the value of r be a user-specified parameter. By the definition of r in (5.52) we derive 

-1 = ~\t, (5.53a) 

=#> 7 = (5.53b) 

T 

which yields a formula for selecting the value of the viscosity coefficient 


b = 


2 m 

T 


(5.54) 


Knowing that we want critical damping, indicating that b 2 — 4mA; = 0, we can derive a formula for 
selecting the value of the spring coefficient 


7Ti 

k = ^. (5.55) 

Although our derivations differ from the ones given in [ Baize I et al., 1988], the approaches are identical. 
The problem that remains is a method for picking the value of r. We will derive such a method in the 
following. 

We want the amplitude to drop to a fraction 0 < e < 1 within a single frame computation. Thus we 
need to determine how many steps n of r arc needed before the amplitude decays to a fraction below e, 


exp (-7T nr ) = ( e_1 ) n ^ e > (5-56) 

from this we derive 

(e -1 ) n < e (5.57a) 

nln(e - 1 )<ln£ (5.57b) 

n>— lne (5.57c) 


In conclusion, we see that we must pick n 
can compute r as 


[— In c] . Given the time-step At between two frames we 



(5.58) 


This completes the derivation for our method of picking parameter values. Pseudocode is listed in Fig¬ 
ure 5.10. In Figure 5.11 is shown a piece of Matlab code, illustrating how parameters could be picked, 
such that the amplitude drops to below one percent of its initial value within a single simulation time-step. 
In Figure 5.12 is shown the output from the Matlab code from Figure 5.11. Note the values of the coeffi- 
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Algorithm pick-parameter-values ( A t,e,m ) 
n = \— In e] 
t = At/n 
b = 2m /t 
k = m/r 2 
End algorithm 

Figure 5.10: Pseudocode for picking parameter values of a spring. The time-step between two frames is 
given by At, the decay fraction of the amplitude is given by e and the mass is given by m. In general m 
would be the reduced mass (see Section 5.2.2). 


dt = 0.01; % Time-step size 

tau = dt/6; % Damping time 

A = 0.02; % Initial Amplitude 

m = 10; % Mass 

gamma = 2/tau; 

delta = l/tau A 2; 

b = 2*m/tau; % Damping Coefficient 

k = m/tau A 2; % Spring Coefficient 

omega_0 = sqrt(delta); % Natrual Frequency 

omega_l = sqrt(omega_0 A 2 - gamma A 2/4); % Frequency 
t = 0:tau/100:6*tau; 

x = A * exp(-(gamma * t)/2).*cos(omega_l*t); % Signal of harmonic oscillator 

plot(t,x) ; 

title('Critical Damped Harmonic Oscillator'); 

xlabel('time (seconds)'); 

ylabel('Penetration depth (cm)'); 

text(tau,0.5*A,textlabel( 'A *exp(-gamma*t)*cos(omega_l*t)') ); 
text(tau,0.4*A,['k = ' mat2str(k)] ); 

text(tau,0.3*A,['b = ' mat2str(b)] ); 


Figure 5.11: Matlab code for picking spring- and damper-parameter values of a critical damped oscillator 
with a controlled decay. 



© 


“book” — 2005/9/30 — 15:44 — page 104 — #116 


£■ 


© 


© 


104 


CHAPTER 5. PENALTY-BASED MULTIBODY ANIMATION 



time (seconds) 


Figure 5.12: Plot showing the amplitude of the critical damped harmonic oscillator from Figure 5.11. 
Notice that the amplitude, i.e., the penetration depth, drops to almost zero within a single time-step At = 
0.01 seconds. 


cients, k and b. The half-life time is an alternative time constant, which is defined as the value r for which 
the signal has dropped to one half of its initial value, that is 


exp 




(5.59) 


If preferred, one could use the half-life time instead of the damping time, to derive formulas for picking 
the values of k and b. 


5.4 Solving Harmonic Oscillators Numerically 

In the previous section we worked directly with the analytical solution to the damped harmonic oscillator. 
This is seldom possible in a simulator, which must rely on numerical methods. This section is devoted to 
the task of applying a numerical method to a spring-damper system. Two methods arc essentially studied: 
the explicit Euler method and the implicit Euler method. The interested reader is referred to Chapter 23.1 
for more details. 

According to (5.51) we have x{t) = A exp (—^/). Differentiating w.r.t. time yields 

x(t) = ~x(t), (5.60) 

which is the ordinary differential equation we will tty to solve numerically. 

For the explicit first-order Euler scheme , the numerical approximation, Xj+i, at time ti + 1 = Z j + h to 


© 


© 


© 


© 




“book” — 2005/9/30 — 15:44 — page 105 — #117 



5.4 PHYSICS-BASED ANIMATION 


105 



(5.61a) 

(5.61b) 

(5.62) 


Since the exact solution is x(t) = A exp ( —|/j then the absolute error is given by 

| x(ti) — Xi\ = v4exp TjihJ — A ^1 — h^j (5.63a) 

= \A\ exp - (i - h^j , (5.63b) 

from which we see that the accuracy of the numerical solution is determined by how well 1 — h\ approx¬ 
imates exp (— 7 / 1 ). Furthermore, the numerical solution will only converge to the analytical solution if 

l-h,l\<l, (5.64) 

see Chapter 23.1 for more details. This effectively restricts the time-step h for the numerical method: 
since h. 7 > 0 then there are two cases to consider. First, if hj < 1, then numerical convergence may 
only occur, when 1 — < 1 , hence 

0 < hy < 2. (5.65) 

Second, if /17 > 1, then numerical convergence may only occur, when 1 — > — 1, hence 

2 < h'y < 4. (5.66) 

Notice, if in the first case h >! then we arc in second case. Thus from 11 — /; 41 < 1 and h, 7 > 0 we 
have 

h< - = ^, (5.67) 

7 b 

where // is the reduced mass of a two object harmonic oscillator as described in Section 5.2.2. From (5.67) 
two things become clear: 

• The greater mass ratios there arc present in the system, i.e., rn l <C m ? , the smaller time-step is 
required. 


• More damping requires less time-step size as well. 
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An alternative to the explicit Euler is the implicit first-order Euler scheme in more depth. It is defined 
as 

x i+ i = Xi + hx i+ i. (5.68) 

Since we know the analytical solution to the problem x(t ) = .4 exp( — %t), this is easily evaluated to 


•t'i+l — •t'i h 2 Xi+1 . 


A litde mathematical manipulation yields 

(l + hx i+ i = Xi, 


1 — 


1 + hfi 


(5.69) 


(5.70a) 

(5.70b) 


Remembering that the analytical solution has x(t) —> 0 for t —> oc, but in contrast to the explicit method, 
the implicit method converges even as h oc: 


= °- (5 - 71) 

This effectively means that the stability requirements do not require a bound on the time-step of the 
implicit method. We say that the implicit method is unconditionally stable. 

Comparing the explicit with the implicit methods for a spring-damper system, the implicit method is 
obviously superior, since it will always converge to the correct solution regardless of the chosen time-step. 
Figure 5.13 compares the implicit and explicit method for different values of the time-step. However, 
while the implicit method always converges to the correct solution, it does so at a slower rate than the 
exact solution. The explicit method gets into more and more trouble the closer h gets to ^; when h passes 
this value the numerics simply explode. Furthermore for those time-step sizes where the explicit method 
is stable, it will converge to zero even faster than the analytical solution. 

5.5 Using the Penalty Method in Practice 

The previous section appeal's to supply bulletproof values for the coefficients of the spring-damper sys¬ 
tem, nevertheless practitioners report that massive amounts of parameter tuning is needed for mass-spring 
systems. Surprisingly enough, they are still widely used and very popular—soft bodies in tools such as 
Maya or 3DMax clearly illustrate this. We believe this popularity is mainly due to the simplicity of mass¬ 
spring systems; they seem easy to understand and implement. However, there is also a computational 
justification for using penalty methods. If a configuration has k contact points, the collision response 
only takes 0(k) time and has very low time constants. In comparison, a constraint-based simulation to 
be discussed in Chapter 7 has at least 0(k 3 ) expected average time for an exact solution, and with quite 
high time-constants and some constraint-based methods, only guarantees expected polynomial time, that 
is 0 () for some n» 1. Although recent research in iterative numerical methods for constraint-based 
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methods yields much better results [Erleben, 2005], the penalty-based approach is still a computationally 
cheap method. Impulse-based simulation , to be discussed in Section 6, is in some cases comparable to 
the O(k) running time of the penalty-based method. However, in some cases it is possible to design 
configurations that have O(oo) running time with an impulse-based simulator. With some tricks of the 
trade it is possible to enforce a 0(ck) running time for an impulse-based simulator with some constant 
c > 1. Other simulation methods solve QP optimization problems. For instance, optimization-based an¬ 
imation [Schmidl, 2002], which is an NP-hard problem. So in short, there is a performance benefit from 
using penalty-based methods, because even though the other methods have worse time complexities, they 
can be numerically more pleasant, allowing them to take larger time-steps. Penalty methods arc known 
to produce stiff, ordinary differential equations, which requires small time-steps to satisfactory solve the 
equation numerically, whereas, for instance, a velocity-based complementarity formulation, as described 
in Section 7, can take extremely large time-steps in order of thousands times that of the penalty method, 
without ever becoming unstable. 

It is not just the problem of stiff ordinary differential equations that can make penalty methods in¬ 
tractable, they can also suffer from a secondary oscillations. These arc resonance effects, which occur in 
forced and possibly damped harmonic oscillators, i.e., where some external time dependent force. Fit) is 
affecting the system, 

x + yx + UqX = F (t). (5.72) 

In classical mechanics the behavior of this system is often studied with 

F (t) = Fq cos (cot). (5.73) 

Here T 7 ], is some constant, and u is the driving frequency. Fit) is called the driving force. Secondary 
oscillation is seen when c o is close to the frequency of the oscillator i.e., ujq for undamped and uj\ for 
damped. The effect is seen as an increase in amplitude. For the undamped oscillator the amplitude goes 
to infinity as uj —> ujq, physically this is not a problem, since no real-world systems arc undamped, but in 
a simulator this might occur, since we arc working on an ideal world. In the damped case, the amplitude 
cannot go to infinity, but is limited by 7, see [Kleppner et al., 1978, pp. 427]. In other words, secondary 
oscillation can amplify the penalty force in a simulator, such that the amplification can kick objects away 
from surfaces. One way to combat this is to make 7 as large as possible, and one way to detect such a 
phenomena is to track the total energy of the system—if it suddenly increases, this might be hint of such 
problems. 

The observant reader might at this point claim that it is very unlikely that there is a driving force in 
form of (5.25), and in most cases, we will often only have to deal with three kinds of driving forces: 

Imprecision: In cases of imprecision, e, the equation of motion becomes 

x + yx + ujqX = e. (5.74) 

We have no way to predict what kind of function e is, we only know it is very small valued, and 
unlikely to be periodic. 

Gravity: Gravity is another force acting on the system, so the equations of motion become 

x + yx + lcqX = e + mg. (5.75) 
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This is a constant force and should not prove to be a problem. 

Contact: There is also the contact forces from all other objects, such as in a box stack. This is equivalent 
to a serial connection of springs as can be seen in Figure 5.14. Looking at the equation of motion 
for a single spring in such a system gives 

x + y/x + ujqX = e + mg + F contact . (5.76) 

Since all other contact forces arc also modeled by spring-damper systems, these forces will act 
as a damped driving force. There might be more than one object in contact, which means that 
the driving force might contain many different frequencies. Multiple contacts might even cause 
a damped driving frequency close to the oscillators frequency, in which case an effect si mi lar to 
secondary oscillation can be expected to occur. 

Looking at all the spring-damper systems as a whole system, the energy is shifted from one spring- 
damper system to another, and the damping will dissipate energy over time and the system will 
eventually settle in equilibrium. Thus, from this viewpoint, there is no secondary oscillation effect, 
instead a wobbly elastic effect may be seen. 

In conclusion, we do not know if numerical issues might cause secondary oscillation, nor whether multiple 
contact formations arc likely to cause a local effect for each spring si mi lar to secondary oscillation. It is 
therefore likely to expect problems similar to secondary oscillation the more complex the configuration 
gets. These secondary oscillations arc another source for parameter tuning, generally speaking, increasing 
the value of 7 should help alleviate the problem. Increasing 7 can only be done by increasing b, which 
means that the system is likely to become over-damped instead of critical damped. 

5.6 Secondary Oscillations 

A secondary oscillation is a resonance phenomena. To gain more insight into the significance of the 
secondary oscillation in the penalty method, we will study the case of a idealized box stacking as shown 
in Figure 5.14. We have three boxes denoted by the numbers 1, 2, and 3. Each box is described by three 
positions, x\, X 2 , and x:>. Horizontal movement is ignored. The contact forces between the boxes arc 
modeled by a critical damped spring system, that is two springs, a and b, arc inserted between the boxes 
as shown in the figure. The springs and damping coefficients arc determined as outlined in Section 5.3. 
The boxes arc given a size of 1 meter, and initially they arc displaced such that there is 1 cm empty space 
between them. The bottommost box is fixed, meaning it has infinite mass and therefore is immovable. 
Damped springs arc inserted between two neighboring boxes whenever they penetrate each other. As 
soon as neighboring boxes arc touching or separated, the springs arc removed. 

Figure 5.15 shows aMatlab implementation of a box stack simulation. In Figure 5.16(a) the simulation 
result of the stack of 3 boxes is shown. The simulation is done perfectly, in Figure 5.16(a) no oscillation 
can be seen, and as verified in Figure 5.16(b) the deviation from equilibrium position is small and boxes 
quickly come to rest. Notice however, that when the upper box comes in contact with the box beneath 
it, then it is thrown upward. This indicates a small energy gain. The damped secondary oscillation that 
was previously discussed is not seen in Figure 5.16. However, let us increase the complexity of the 
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Figure 5.14: Idealized box-stack example, used for analyzing secondary oscillations. Notice that penalty 
springs arc a serial connection of a mass-spring system. 


configuration by increasing the stack height. Simulation results arc shown in Figures 5.17 and 5.18 for 
box stacks of 5, 10, 20, and 40 boxes. For a stack of 5 boxes we see nearly the same results as we 
did for a stack of 3 boxes, however some small jittering is slowly becoming noticeable. It takes roughly 
0.2 seconds before the jittering vanishes, which corresponds to 20 frames of the animation. The jittering 
becomes worse and longer as the box stack is increased. For 20 and 40 boxes it has become so clear 
that it will catch the eye of an observer. If we look at the deviation from the equilibrium positions, the 
damped secondary oscillation is clearly seen as we increase the height of the box stack, stalling to be 
noticeable around a stack of 10 boxes. Notice also that the oscillations arc orders of magnitude larger than 
the simulation time. For stack heights of 20 and 40 another effect of the penalty method becomes evident: 
when the system reaches equilibrium there arc still errors in the simulation. Notice that for a box of 40 
objects the topmost box deviates from its expected equilibrium position by 5 cm, i.e., 5% error. Of course 
this an accumulated error, due to the errors in the positions of the 39 boxes below the topmost box. The 
deviations for the stack height of 40 boxes also shows that within the first second of simulation, we will 
see a single oscillation of the topmost box with an amplitude of 10 cm. This will definitely be noticeable 
by an observer. 

Another interesting effect of a penalty-based method is seen if the initial conditions of the box stack 
is changed slightly, such that the boxes arc initially penetrating by 1 cm instead of being separated. Fig¬ 
ure 5.19 shows the simulation results of a box stack with seven initial penetrating boxes. Notice how 
objects arc being thrown up in the air. The initial penetrations make the springs work like a spring gun on 
the boxes. Even a small penetration as the one in the example can make the topmost box fly up in the air 
at a height of the same order as the size of the box itself. 

In our simulations shown in Figures 5.16, 5.17, 5.18, and 5.19, the mass properties of the boxes were 
chosen in a favorable way, the masses were all equal and small in magnitude. If a box stack has large 
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function x_plot = boxstack(N) 
x_equilibrium = 0:1.:N; 
x = 1.01*x_goal; 
v = zeros(size(x)); 
m = ones(size(x)); 
m(l) = realmax; 

mu = (m(1:end-1).*m(2:end))./(m( 

timestep = 0.01; 

tau = timestep/6; 

kc = mu/(tau*tau); 

be = 2*mu/tau; 

h = min( mu./bc); 

i = 0; 

F = zeros(size(x)); 

x_plot = x'; 

while ( i<maxsteps ) 

dx = x(2:end) - x(l:end-l) 
dv = v(2:end) - v(l:end-l); 

S = zeros(size(dx)); 
if max(dx<0) 

S(dx<0) = S(dx<0) - kc(dx<0) 


%- Equilibrium of N boxes width of 1 m. 

%- Setup initial positions, 1 cm space. 

%- initial velocity of boxes 

%-masses of boxes 

%- First box is fixed 

1:end-1)+m(2:end)); %- Reduced masses 

%- Simulation time-step 

%- Damping time 

%- Spring coefficients 

%- Viscosity coefficients 

%- Integration time-step (<4 mu/b) 

%- Iteration counter 

%- Total external force 

%- Results 


l; 


. *dx(dx<0) - be(dx<0).*dv(dx<0); 


end 

F(2:end) = -m(2:end)*9.81; %- Gravity 

F(2:end) = F(2:end) + S(l:end); 

F(2:end-1) = F(2:end-1) - S(2:end); 

F(1) = 0; %- First box is fixed 


x = x + h*v; 
v = v + (h * (F./m) ) ; 
i = i +1; 

if ( mod(i,20)==0 ) 
x_plot = [x_plot,x']; 
end 
end 


Figure 5.15: Matlab code for resonance phenomena analysis of box stack. 
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time (seconds) 


Deviation from equilibrium positions 



(a) 


(b) 


Figure 5.16: Simulation result for stack of 3 boxes. 


ratios in masses, the same effects as we have shown can appeal - for even small box stacks of height 2 or 
3. Besides the large mass ratios will make the ordinary differential equations describing the box motions 
become even stiffen 

5.7 Inverse Dynamics Approach 

In this section we will embark upon a quite different approach originally presented in [Barzel et al., 1988]. 
The approach taken here does not suffer from the secondary oscillation problems described earlier. The 
general idea is to set up the desired motion, by using the critical damped spring system, and then solve for 
the penalty forces required for this desired motion to occur, this is called inverse dynamics. After having 
solved for the penalty forces, the total force acting on each object can be computed and the equations of 
motion can be integrated forward in time (forward dynamics). We devote this section to outline the system 
of equations that needs to be solved for the inverse dynamics problem. 

Consider a total of n rigid bodies and K contacts, and assign a unique number k to each contact. Then 
assign a unique index for each object in our scene, and let i k and j k denote the indices of the two incident 
bodies in contact k. We use the convention that i k < j k . We also have a contact normal n k and a contact 
point p ki with respect to body i k (similar for body j k ). Both contact normal and points are given in the 
world coordinate system, and with the convention that the contact normal points from the body with the 
smallest index to the body with the largest index. This is illustrated in Figure 5.20. Note that we can never 
have i k = j k . For each contact we can compute a vector from the center of mass, r „ of an incident body 
with index i, to the point of contact p ki , that is 

r ki = Pki ~ r i (5-77) 

From classical mechanics we have the Newton-Euler equations (see Section 22.1 for details) describing 
the motion for all bodies. For the Fth body, the mass of body i is given by rn, and the inertia tensor by I t , 
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Figure 5.17: Simulation results for box stacks of (a) 5 and (b) 10 boxes. 
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(a) (b) 


Figure 5.19: Simulation results for stack of seven boxes with initial penetration of 1 cm. 



Figure 5.20: Notation for the /c’th contact point. 
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the position is given by r t and the velocity of the center of mass as the orientation is represented by the 
quaternion q % and the angular velocity by uj 1 . The acceleration of the center of mass is denoted by a 8 and 
the angular acceleration by or,. The Newton-Euler equations for the i’tli body look like this (summations 
arc taken over all contact points): 


r t = Vi (5.78a) 

Qi = (5.78b) 

Vi = m- 1 fk ~ m * _1 fk + m i ' fT l (5.78c) 

jk = i ik = i 

Wi = Z" 1 r k? x fk-ir 1 ^ Tki x fk (5.78d) 

jk = i i'k =, i 

- x IiUi + /-Vf (5.78e) 


The dot notation means the time derivative 4, and is used to ease readability. Observe that f k denotes the 
contact force at the k ’th contact. The effect of all external forces on the center of mass is given by /)' xl 
and the total torque from external forces arc given by r? xt . 

We will start with defining two position vectors, 


r h 

+ 

rij 

r h 

+ 

r 2j 

r 3 K 

+ 

r Kj_ 

r h 

+ 

ru 

Vi2 

+ 

r 2 i 

r iK 

+ 

V Ki 


The concatenated separation vector now simply reads 


(5.79) 


(5.80) 


d = Pj ~ Pi- 


(5.81) 


In practice, the separation vector is more easily computed by concatenating the contact normals multiplied 
by the corresponding penetration depths. Introducing the “contact” matrix C G M 6nx3K defined as, 


C hk — 


— 1 for h = 2i k 

-r y H for h = 2 i k 

< 1 for h = 2 j k 

r kj for h = 2 j k 

0 otherwise 


1 

1 


(5.82) 
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where a x is the cross product matrix (see Proposition 18.4) and the generalized velocity vector u G M 6n 
as 

u = [vi,u;i,V2,u>2,--- ,Vn,Vn] T ■ (5.83) 

Then the derivative of the separation vector is simply written as 

d = C T u. (5.84) 


By the product rule we find the second derivative to be 


d = C T u + C T u, 


where 


C hk =< 


-K) X ^ X , for h = 2ik 


(“j k Vr x kj 

0 


Now using the generalized mass matrix M £ 

mil 

M = 


6 nx6n 


for h = 2 j k 
otherwise 


m„l 


and the external forces, torques, and velocity dependent forces, / ext £ 


/ext = [/l X \ T e i Xt - Wi X JiWi, • • • , T® Xt - CO n X I n U> r 


(5.85) 


(5.86) 


(5.87) 


(5.88) 


The Newton-Euler equation reads, 

u = M- 1 (C/ penalty + / ext ) . (5.89) 

Here / alty £ M 3A is the concatenation of all the contact forces f h , using this in our derivative formula 
results in 

d = C T u + C t M~ 1 (C/ penalty + / ext ) , (5.90) 

which is rewritten into, 

d = C T u + + C T M _1 f exV (5.91) 

Requiring all contacts to be critical damped results in the simultaneous vector equation, 

d+-d+-^d = 0. (5.92) 

r t- 
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Substitution of our past results yields the matrix equation, 

( C T u + C T ]M^ C f penalty + C T M~ l f^ t ) + ^ C T u + ^d = 0. 
ReaiTanging a little yields, 

gjVT^C / penalty + C T u + C T M- l f sx t + + ^d = 0 


penalty ^ 


(5.93) 


(5.94) 

(5.95) 


This is a linear system, which can be solved for the penalty forces. Observe that A might not be invertible; 
multiple contact points will result in linearly dependent rows. Multiple solutions can exist; a method like 
singular value decomposition (SVD) can therefore be applied to solve for the penalty forces. Given the 
generalized position and orientation vector, s G M 7n : 


s = [ri,qi,r 2 ,q2,'-- ,r n ,qn} T , 

and the matrix S G M 7nx6n , 

1 0 ' 

Q i 

5 = 

l 

0 Q n 

where q* = [sj, Xi,yi, Zi] 7 G M 4 and Q i G M 4x3 : 

-Xi -yi -Zi 
Si Zi -Vi 
Z"i Si Xi 

Vi 



(5.96) 


(5.97) 


(5.98) 


The matrix Q i is derived from the relation \oJiqi = as shown in Proposition 18.47. The Newton- 

Euler equations, see Section 22.1, can now be written as 

s = Su (5.99) 

u = M- 1 (C/ penalty + /ext) . (5.100) 

Now all that remains is to solve these ordinary differential equations with a suitable numerical integration 
method, see Section 23.1. 

If we restrict the penalty forces to work only in the normal direction, that is 

fh = n h f h , (5.101) 
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and by introducing the matrix of contact normals N e R 3K 


xK 


N = 


n i 


n 2 


0 tik 

this is simply written for all contact points as the matrix equation 


f penalty ^ 


h 

h 

fl< 


= N f. 


Now (5.81) changes slightly, and instead we will have, 

d = N T (;pj - 


(5.102) 


(5.103) 


(5.104) 


The vector d is now simply a vector of penetration depths measured along the contact normals. Perform¬ 
ing the double differentiation again, this time remembering that the newly added term N is also time 
dependent, yields 


d = N T ( Pj - p^ + N t C t u , (5.105a) 

d = N X ( Pj - p^ + N T C t u + N T C t u + N t C T u + N T C T ii (5.105b) 

= N X ( Pj - p^ + 2 N T C t u + N t C T u + N t C t u. (5.105c) 

Using d + ^d + = 0 and ii = AT” 1 (CN f + / ext ) together with some mathematical manipulation 

yields the linear system, 

Af + b = 0, 

where 

A = N T C T M~ l CN , 
and 

b = Ar T C T M” 1 / ext + N 1 ( Pj - Pi) + 2 N T C t u + N t C T u 
+ \ (Pj - Pi) + N T C T uj + ^ {N t ( Pj - p^) . 

The only difficult things to handle in (5.108) are the terms N and N. However, since their elements are 
vectors each describing only a direction, we can compute these matrices by taking the cross product of the 


(5.106) 

(5.107) 

(5.108) 
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individual elements with the respective angular velocities. That is, 

["^n X rii 


N = 


u h x n 2 


0 


(5.109) 


[0 uj iK x n K J 

where ui lk denotes the angular velocity of body i at the k ’th contact point. By differentiation w.r.t. time 
we have 


a h x Tii T Wu x x ni) 


N = 


0 


(5.110) 


[ 0 a iK x n K + Ui K x ( u iK x n K )\ 

where ot lk denotes the angular acceleration of body i at the k ’th contact point. Having solved for /, we 
can write the equations of motion as 


ii = M- 1 (CNf + / ext ), (5.111) 

and the forward dynamics problem can be solved by integration of (5.111). 

5.8 Modeling Friction 

Previously we have only been concerned with normal forces, that is forces that should prevent penetration 
of the rigid bodies, however another important contact force is friction. Coulomb’s friction law as de¬ 
scribed in Section 22.2 is a popular choice. It consists of two kinds of friction static friction and dynamic 
friction. The latter occurs in the case when objects arc relatively sliding. 

Friction is very important for the motion to seem physically plausible for a human observer. The 
reason is that friction is an important paid of our everyday life. In fact we arc so used to friction that we 
often don’t even think about it. How should one go about modeling friction in a penalty-based simulator? 
It seems reasonable to use a spring-damper system for the friction force as well [Hasegawaet ah, 2003, 
Pedersen et ah, 2004]. This is done by tracking the contact points. 

At first point of contact, an anchor point, a, is saved. This is an unmovable point in the world 
coordinate system. Subsequently the friction is determined by the spring force stemming from the zero- 
length spring with spring coefficient k between the anchor point and the current position p of the contact 
point that caused the creation of the anchor point. 

E friction = k (a-p). (5.112) 

During the simulation we keep an eye on whether the following condition holds 

11 -^friction 11 T I 1 | -^normal 


(5.113) 
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Figure 5.21: Figure illustrating the idea of frictional springs. 

Here /t is the coefficient of friction, and -F n0 rmai is the normal force, where the penalty force often may be 
used as the normal force. If the condition is fulfilled we say that we have static friction and we do nothing, 
if, on the other hand, the condition is broken we say that we have dynamic friction, and the anchor point 
is moved such that the following condition holds, 

11 ^"friction 11 = p H-f 1 normal || • (5.114) 

The general idea is illustrated in Figure 5.21. The frictional spring force model can be physical justified 
as a crude model of the external shear stresses acting between two bodies [Lautrup, 2005]. Since it is the 
external shear stresses that we experience as friction in our everyday life. As such, the friction model adds 
an element of compliance to the rigid bodies. 

Another approach to model friction is simply to use (5.114) as used in [McKenna et al., 1990], i.e., 

II || Uf 

-^friction — F -^penalty 7j 17) (5.115) 

\\u t \\ 

where u t is the tangential relative sliding velocity. This approach only models dynamic friction, thus 
static friction is ignored. 
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5.9 Survey on Past Work 


Moore and Wilhelms [Moore et al., 1988] were among the first to model contact forces by springs in 
computer animation. They used a simple linear spring, but added a twist of letting the spring constant 
depend on whether the motion is receding or approaching. The relationship is as follows 


^-recede — £ ^-approach; 


(5.116) 


where e describes the elasticity of the collision, e = 0 corresponds to totally inelastic collisions, and e = 1 
to perfectly elastic collisions. 

Furthermore, Moore and Wilhelms extended the penalty method with an algebraic collision resolving 
method, similar to Newton’s collision law, as described in Section 6.1.4.1. The main idea is to handle 
colliding contacts before applying springs. This is because colliding contacts requires very stiff springs, 
which arc numerically intractable. 

Terzopoulos, et al. [Terzopoulos et al., 1987] used another kind of penalty force. The main idea 
behind their derivation comes from conservative forces, which arc known to be the negative gradient of a 
energy potential, i.e., the negative gradient of a scalar function, see Section 22.3. A scalar energy function 
is then designed, which penalizes penetration: 

E = cexp(^^j , (5.117) 


where c and e arc constants used to determine the specific shape of the energy potential. The penalty force 
is then 



(5.118) 


where n is the unit contact normal. Such penalty forces arc called exponential springs. Exponential 
springs arc stiffer for large displacement than linear springs. However, for small displacements exponen¬ 
tial springs arc less stiff than linear springs [McKenna et al., 1990]. 

Barzel and Bair [Barzel et al., 1988] used a constraint force equation. 


D 4 —D 4— t;D — 0, 


(5.119) 


where D is a function measuring the constraint deviation, i.e., the penetration depth, r is a specified time 
constant, used for controlling the rate of constraint satisfaction. Comparing with our damped harmonic 
oscillator, we see that 


2 b 


^ t m 

(5.120a) 



ol k 

= — = —• 

(5.120b) 

r m 


b 2 — Arnk = 0. 

(5.121) 


The criteria for critical damping is, 
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Dividing by m? gives, 


b 2 k 

, — 4 
m- m 


= 0 


7 2 — 4cjq = 0. 


(5.122a) 

(5.122b) 


Substituting the 7 = 2 , and to q = ^ we see that the model used by Barzel and Barr is critical damped. 
Furthermore the paper [ Barzel et al., 1988] formulates an inverse dynamics problem which handles simul¬ 
taneous constraints. 

In [McKenna et al., 1990], a twist on penalty forces is introduced to model collisions, inspired by 
Newton’s impact law, described in Section 6 .1.4.1, which relates pre- and postvelocities through a coeffi¬ 
cient of restitution, e, where a value of zero corresponds to completely inelastic collisions and a value of 
1 corresponds to fully elastic collisions. For interpenetrating contacts, moving away from each other, the 
penalty force is multiplied by e. 

if rfdepth < 0 and u ■ n > 0 then 

-^penalty — --^penally 

end if 


In Hirota et al. [Hirota et al., 2001, Hirota, 2002] a penalty force is integrated over the intersecting 
polygon and combined with an implicit finite element method. 

In Jansson and Vergeest [Jansson et al., 2002a], a mass-spring model is used for modeling both rigid 
and deformable objects, here simple springs are created when particles move within a nominal distance, 
and springs arc deleted again when their incident particles move further away than a given fracture dis¬ 
tance. The spring force is simply modeled as 


F = —l 


\Xi — Xj 


~I) 


Xj — Xj 
\Xi — x.j 


(5.123) 


where k, is the spring constant, x, and Xj arc the particle positions, and l is the nominal distance. Details 
for determining nominal and fracture distances can be found in [Jansson et al., 2002b]. 
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Impulse-Based Multibody Animation 


Impulse-based simulators [Hahn, 1988, Mirtich, 1996, Guendelman et al., 2003] simulates all physical 
interactions between the objects in the configuration by collision impulses. Static contacts such as one 
object resting on another arc modeled as a series of collision impulses occurring at a very high frequency. 
Except for static contacts, impulse-based simulators arc computationally effective for systems having 
many objects moving at high speeds. 

Constraint-based simulators, as described in Chapter 7, solve for the time-integral impulse in order 
to determine the final velocities of objects. This is not to be confused with the impulse-based paradigm 
described in this chapter, which uses collision impulses to model physics. To make the distinction clear - , 
we refer to these constraint-based simulators as velocity-based simulators, and simulators using collision 
impulses as impulse-based simulators. 

In an impulse-based simulator a one-sided-approach to advance the simulation time is typically used 
together with a sequential collision resolving method based on some incremental collision law. In this 
chapter, we will introduce the reader to these concepts and their theories. 

It is worth noting that impulses often need only to be applied at the closest points between two objects. 
The contact determination or spatial-temporal coherence analysis [Erleben, 2005], therefore may often be 
completely omitted from the collision detection pipeline. This greatly simplifies the implementation of an 
impulse-based simulator compared to a constraint-based simulator, as described in Chapter 7. 

We will start by studying the physics and mathematics behind a single point of collision, followed by 
an explanation of how general-purpose simulators can be built using only single-point collisions to model 
all physical interactions. 

6.1 Single-Point Collision 

A single-point collision means that if we have two objects, A and B, then they touch each other at exactly 
one point. We call this point p. We will assume that we have some method for computing a unique normal 
vector, n, at the point p. This is explained in detail in Chapter 14. We call this vector the contact normal. 
For objects with continuous smooth surfaces, the contact normal is parallel to the surface normals of the 
two objects at the point p. Note that under this condition, the two surface normals of the two objects are 
parallel but pointing in opposite directions. In the following, we will use the convention that the normal 
n always points from B toward A. The contact normal n and the contact point p define a contact plane 
consisting of all points that fulfill the equation 

Mx £ M 3 where n ■ x — n ■ p = 0 . ( 6 . 1 ) 

Alternatively, we can represent this contact plane by two orthogonal unit vectors 1 1 and t- 2 , both lying in 
the contact plane and both orthogonal to the contact normal. We will use the convention that these vectors 
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Figure 6.1: An example of a single-point collision with the origin of the World Coordinate System (WCS) 
indicated. 

form a right-hand coordinate system with the contact normal. That is 

n = 1 1 x t' 2 - (6-2) 

For 2D, we will simply drop the subscript on the tangent vectors and use t. Knowing the position and 
orientation of object A and B, we can compute the contact points w.r.t. for either one of these objects, that 

is, 

p A = r A + r c A , (6.3a) 

p B = r B + r c B , (6.3b) 

p A = p B . (6.3c) 

The superscript refers to the object, r cm is the position of the center of mass, and r is a vector from the 
center of mass to the contact position. The notation is illustrated in Figure 6.1. We’ll look at the contact 
point on body X, this is given by 

p x = r x +r c l , (6.4) 

using the body frame BFx, as described in Section 22.1.4, we can rewrite r x as 
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where R x is the orientation of the body frame of object X represented as a rotation matrix. Now using 
(6.5) in (6.4) we have 

P X = # A [r X ] BFx + r c x . (6.6) 

If we differentiate ( 6 . 6 ) with respect to time, we get the contact velocity of the contact point on body X, 
that is, 

P X = [ r X ] BF X + f 'cm • (6-7) 

The columns of R x are unit vectors along the axes of BFy- These are body-fixed vectors. So using what 
we know from (22.27) we can write the time derivative of a body-fixed vector as the cross product of the 
angular velocity and the body-fixed vector, that is ( 6 . 6 ) is rewritten as 


p x = R X [ r 

*1 +v x 

J BFx cm j 




(6.8a) 

p x =[u x 

X R x 

U X X 

R x \u x x R x 

1 \ r x ] +v x 

J L J BFx cm ’ 

(6.8b) 

p x =ut x x 

(« A ' [> 

* - Y 1 
JBFx. 

) + Wm, 



(6.8c) 

where we have introduced the notation r 

• x =v 

cm (. 

for the linear velocity of the center of mass. 

Now we 

do a mathematical trick of adding zero to (6.8c) and obtain 






( 


\ 



P X = 

X 

3 

R x [r 

X 1 +r x -r 

J BFx cm 

X 

cm 

+ v x 

1 ^ cm • 

(6.9) 



V 

P X 

J 



We recognize the (6.6) and reduce (6.9) to 






P X = 

= a; x x 

(p X ~r X )+v 

cm’ 


(6.10) 

using (6.4) we have 









r x = 

-P X ~ r Ym , 



(6.11) 

substituting into (6.10), we have 








P 

x = ^ 

x xr x +v c x . 



(6.12) 


We can now write a formula for the relative contact velocity, u, between the two objects A and B. 

u = p A — p B . (6.13) 

The relative contact velocity tells us something about how the two objects are moving relative to each 
other. For instance, if we look at the relative contact velocity in the normal direction, 

u n = n T (p A -p B ) , (6.14) 

B—. 
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then we know that 


relative contact = 


Separating 
< Resting 
Colliding 


if u n > 0 
if u n = 0 
if u n < 0 


(6.15) 


A separating contact means that in the future, no matter how small a time-step we look forward, the 
objects will no longer be touching at the contact point p. Resting contact means that the contact point 
p will persist in the future regardless of how small a time-step we take. Colliding contact means that if 
we do not take any counter action, such as applying a collision impulse at the contact point, then the two 
objects will penetrate in the future no matter how small a time-step we look ahead. 

We arc interested in the colliding case, thus we have derived an equation telling us when we need to 
apply a collision impulse to a single point of contact in order to avoid penetration. 


Theorem 6.1 (The Collision Test) 

Two bodies A and B are colliding, if, and only if, 


u n = n T ((u> A x r A + vj,) - (u> B x r B + v c f n )) < 0. (6.16) 


In the real world, a collision will not happen at a single point of contact, but it will occur over a small area. 
Actually, what happens is that when real objects come into contact, they deform slightly to accommodate 
each other’s shape. During this deformation the kinetic energy of the objects is transformed into internal 
elastic energy in the objects and some of the energy is dissipated as heat. When the kinetic energy reaches 
a minimum, the objects will have come to a stop relative to each other. We say that we have reached the 
point of maximum compression between the two objects. Now the buildup elastic energy is converted to 
kinetic energy. While doing so, the objects will begin to move away from each other and the deformation 
will reverse. This real-world model of a collision is depicted in Figure 6.2. The above description is still 
a very simplified view about what is going on in the real world. We have neglected many small details, 
such as plasticity of the deformation, fracture, and chemical and thermo-dynamical effects going on in the 
materials. However, for our purpose of computer animation, it suffices to understand the real world just a 
little better. 

In particular, we should note two things: first, the collision occurs over a small time duration; it starts 
at time t, and ends at time tf. Somewhere in between we have the time of maximum compression t mc . In 
particular, we know that for these times, we have 



/ 

ti 

= t > U n 

T 

= n 

0 p a 

- p b ) 

A 

o 

t = < 

tmc 

U n 

T 

= n 

(. p a 

- p b ) 

= 0 



= t > U n 

T 

= n 

0 p a 

- p b ) 

O 

A 


(6.17) 


Second, two phases take place, a compression phase where energy is built up in objects, then a restitu¬ 
tion phase where internal energy is released back as kinetic energy. As we will see later, most physical 
laws describe this process by a coefficient of restitution , which indicates the amount of energy that is 
transformed into kinetic energy during the restitution phase. 

From our everyday experience, we do not usually notice the deformation process taking place in 
what we understand as a rigid body collision. This is because the time-duration of the collision becomes 
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Compression Restitution 


Figure 6.2: A real-world collision consisting of two phases: a compression and a restitution phase. 

smaller and smaller the more rigid an object is. In the limiting case of an infinitely rigid object, we have 
a mathematical model of a rigid body. This model is often referred to as the rigid body assumptions. To 
make it perfectly clear, when those of us in physics, simulation or animation use the term rigid body, we 
assume the rigid body. There do not exist any such objects in the real world that actually behave according 
to these assumptions; however, the approximated behavior of many objects can be explained by them. 

Definition 6.1 (The Rigid Body Assumptions) 

During a collision, the following four assumptions apply to rigid bodies: 

• The duration of the collision is zero, that is tf — t, j —> 0. 

• Only impulsive forces must be used to avoid penetration. 

• Positions and orientations are the same before and after the collision. 

• Non impulsive forces have no effect during the collision. 

Definition 6.1 has many consequences. In practice it implies that there arc no wave “effects” in the 
collision, such as the deformation. We can ignore gravity and velocity-dependent forces such as Coriolis 
forces etc. 


Impulse is defined as the time integral of force. Given the net force, F, acting under a collision, the 
collision impulse is therefore 
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Note that the collision impulse has units of force x time. From Newton’s second law we have 


F 



J 

J 


dP 

dt 



P(t f ) ~ P(U ), 

A P. 


(6.19a) 

(6.19b) 

(6.19c) 

(6.19d) 


In the above, we have only considered linear motion, but it is clear that the collision impulse corresponds 
to a change in the momentum of a rigid body. Since P = mv cm , we see that the impulse is equivalent to a 
change in the velocity of the rigid body, when the mass of the rigid body is considered to be constant. The 
exact relationship between impulse and change in rigid body velocities is derived in Theorem 6.2. From 
Newton’s third law it follows 

Ja = ~Jb■ ( 6 . 20 ) 


Mathematically it is difficult to work with (6.18), when tf — ti —> 0. In order to circumvent this problem, a 
reparameterization is used when deriving equations. Any legal parameter 7 must by monotonically strictly 
increasing, that is, 



( 6 . 21 ) 


This requirement comes from the equivalent real-world model, where time always runs forward, never 
pauses, or stops. If we have a 7 parameter with the same property, then there is a one-to-one correspon¬ 
dence between the 7 -parameter and the time-parameter. Therefore, it does not matter with which one we 
choose to parameterize our motion. Looking at a real-world collision, we know that the impulse in the 
normal direction has 


dJ n 

dt 


> 0 . 


( 6 . 22 ) 


This means for any legal parameter 7 the following condition must also hold 


dj 

dJ n 


> 0 . 


(6.23) 


This follows from the same argumentation that we used to explain the requirement in (6.21). 

In order to compute an impulse, we need a method called a Collision Law. There are basically four 
different kinds of Collision Laws: 


• Algebraic Collision Laws 

• Incremental Collision Laws 

• Full Deformation Collision Laws 

• Compliant Contact Model Collision Laws 
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Algebraic laws arc characterized by the fact that they consist of an algebraic equation. Given the initial 
conditions of a collision, we immediately know the collision impulse by evaluating an algebraic equation. 
Therefore, algebraic laws arc computational inexpensive and for the same reason, they arc attractive in 
computer games and computer animation. They do, however, lack some realism, which is captured by the 
three alternative collision laws. 

Incremental laws, arc characterized by having some microscopic contact model, described as an ordi¬ 
nary differential equation. The resulting impulse is then computed by integration over the compression 
and restitution phases. Incremental laws arc computationally more expensive than algebraic laws, but they 
produce more realistic results. For an example of an incremental law we refer the reader to [Mirtich, 1996]. 

Full-deformation laws arc properly the computationally most expensive of all the four kinds of col¬ 
lision laws. Full-deformation laws arc based on continuum mechanics equations for the complete body, 
such as lineal - elasto-dynamic equations. Here a partial differential equation is solved using appropriate 
material properties. Full-deformation laws are not really usable in computer animation for two reasons: 
first they are too computationally expensive and second, it is impossible to determine the initial conditions 
for the multitude of material properties that need to be specified. Thus, algebraic laws are prevalent and in¬ 
cremental laws are used to some extent. See [Chatterjee et al., 1998] for references about full-deformation 
laws. 

Finally, there are compliant contact model laws, which we mention for completeness only. These 
collision laws can be thought of as a kind of spring-damper system, which describes the deformation 
process that takes place during a real-world collision, similar to the penalty-based methods described in 
Chapter 5. One has to determine stiffness and damping terms for this kind of collision law. Figure 6.3 
illustrates a compliant contact model. We refer the interested reader to [Kraus et al., 1997] for more 
details. The theme of this book is animation, therefore we will limit ourselves to treat only algebraic 
collision laws. The general recipe for a collision law consists of combining a contact model with one or 
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more physical laws. 

Definition 6.2 (Collision Law) 

A collision law is 

Collision Law -7= Physical Law + Contact Model (6.24) 

Physical laws arc often used in one of two ways: either as stop criteria in an incremental law or to eliminate 
unknown variables in an algebraic law. Usually there arc also some assumptions, such as the impulse is 
only pointing in the contact normal direction or something similar. 

In computer animation we arc interested in: how fast can we compute an impulse? How difficult is 
it to set up the collision law? How realistic arc the effects a collision law can give? In the following, we 
will derive theory and tools that can help the reader evaluate these three questions, and finally we will end 
with some examples on typical used algebraic collision laws. 

6.1.1 Impact and Friction Laws 

Four physical laws arc often encountered in collision laws: Newton’s impact law, Poisson’s hypothesis, 
Stronge’s hypothesis, and Coulomb’s friction law. 

Newton’s impact law defines a simple linear relation between the initial relative contact normal veloc¬ 
ity, u n {pfi), and the final relative contact normal velocity, u n (yj), 

u n (lf) = -euni'ri), (6.25) 

where e is the normal restitution coefficient with values limited to the interval 

0 < e < 1. (6.26) 

The restitution coefficient can be thought of as a measure of bounce. A value of one corresponds to a 
complete elastic collision. That is, if a ball is released over a table, the ball will bounce back from the 
table and jump back to the same height it was released from, before falling back down toward the table. 
This bouncing up and down will continue forever. If the value is set to zero, then the ball will hit the table 
in a perfect inelastic collision, that is, the ball will hit the table bluntly and just stop moving upon impact. 
This is perhaps best described as a dead ball. 

In practice, neither setting the coefficient of restitution to zero nor one yields very realistic behavior. 
However, some algorithms use these extreme cases to model certain aspects. Also these two extreme cases 
are very useful when testing a simulator. 

Newton’s impact law is considered the most simple physical law that describes the effect of a rigid 
body collision. It is commonly used in computer games, but when combined with friction it suffers 
from serious problems, such as a breakdown of energy conservation. That is, adding frictional impulses 
according to Coulomb’s friction law could result in energy gain. 

Poisson’s hypothesis describes a linear relationship between the normal compression impulse J n (7mc) 
and the normal restitution impulse J n {lf) — Jn(7 mc) 

’Jnipf) Jn{ 7mc) — &Jn{, 7mc)> (6-27) 
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where e is the normal restitution coefficient with values limited to the interval 


0 < e < 1. (6.28) 

By the normal impulse, we mean the size of the projection of the impulse onto the contact normal. That 
is, J n ( 7) = n ■ J( 7). Poisson’s hypothesis is a little more advanced than Newton’s impact law, however 
it suffers from the same problem. Energy conservation breaks down when friction is added. 

Stronge’s hypothesis describes a quadratic relationship between the work done by the normal impulse 
during the compression phase W n ( 7 mc ) an d the work done by normal impulse during the restitution phase 

h u 

w n (yf) - W n (p/mc) = -e 2 W n (J mc ), (6.29) 

where e is the normal restitution coefficient with values limited to the interval 


0 < e < 1. 


(6.30) 


The work done by the collision force F over the duration of a real-world collision is by definition work 
(see Section 22.3): 


W = 


rtf 


F(t) ■ x(t)dt. 


Reparameterization gives us a workable model applicable under the rigid body assumptions 

W = F(l) ■ (^) dt - 


(6.31) 


(6.32) 


According to Newton’s second law F( 7) = 7), and without loss of generality, we assume 7 * = 0. 

Furthermore, *(7) is the velocity of the contact point, thus with respect to body A we have ±(7) = ua (7). 
Therefore the work done on object A is 


ns d 

Wa= u a {t) ■ -r J( Ci)dl- 

Jo “7 


Similarly, the work done on object B is 

W B = ~ 

Since u = ua — ub, we end up with the total work 


7 / d 

« b ( 7) • (7)^7- 

dy 


ns d 

W= u(y) ■ —J(y)dy. 
Jo “7 


(6.33) 


(6.34) 


(6.35) 


Taking the dot product n ■ J (7) gives the amount of work done in the normal direction. When we have 
derived the Impulse-Momentum Law in Section 6 .1.2 the work (6.35) can be written in terms of the relative 
contact velocity only. 
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To our knowledge, Stronge’s hypothesis is considered the most accurate physical law describing a 
rigid body collision today. The major benefit of Stronge’s hypothesis is that energy conservation holds 
when friction is added. This means in terms of physics, it is superior to both Newton’s impact law and 
Poisson’s hypothesis. 

Coulomb’s friction law describes a relationship between the frictional force and the magnitude of the 
normal force. A more detailed description of Coulomb’s friction law can be found in Chapter 22.2. If we 
know the force of the normal direction, F n , and the relative velocity in the contact plane u t is nonzero, 
then we have dynamic friction, also called sliding friction. In this case the friction force F t is given by 

Ft = ~bir~~u ll F n||, (6.36) 

where // is the coefficient of friction. If the relative velocity in the contact plane is zero then we have static 
friction ; also called dry friction. We distinguish between two cases of static friction: stable and unstable. 
If the time derivative of the contact velocity in the contact plane is zero, that is 

u t = 0, (6.37) 

then we have stable static friction and the magnitude of the friction force is given as 

\\F t \\ <p\\F n \\. (6.38) 

However, unlike dynamic friction, we cannot say anything about the direction. If, on the other hand, 

'ii/ f 0, then we have unstable static friction and the friction force is given by 

\\F t \\ = n\\F n \\ and F t u t < 0. (6.39) 

Thus, in this case, we know that the friction force obtains its maximum magnitude and that it opposes the 
relative acceleration in the contact plane. However, the friction force is still not uniquely determined. The 
name “unstable” means that a transition can occur either to dynamic or stable static friction. 

It should be mentioned that the coefficient of friction often has different values under static and dy¬ 
namic friction. In computer animation, this is often overlooked and the same value is used for both the 
static and the dynamic case. 

6.1.2 Contact Model 

From Newton’s second law we have 

F A (t) = m A a c A ( 7), (6.40) 

and from Euler’s equation we have 


T A ( 7 ) = r A x F A (y) = I A a A (y) + u A (y) x I A u A (7). 


(6.41) 
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The term u A ( 7 ) x I A u> A ( 7 ) can be ignored during collision, since it is only dependent on velocity 
terms. So we have the more simple equations 

F © 7 ) = m A a c A (y), (6.42a) 

r A xF A ( 7 ) = I A a A ( 7 ). (6.42b) 


Integrating from 7 * to 7 < 7 /, we have 


F A (l)dl= / rn A a c A ('-y)d , y, 


’a 


’n 



x F A (y)d 7 = / A {l)d'y. 


"yi 


(6.43a) 

(6.43b) 


Using the definition of impulse (6.18) on the left-hand side and noticing that mass terms on the right-hand 
side are constant, we have 


J © 7 ) = m A (v c A (y) - -u c A ( 7 i)), (6.44a) 

Av cm(l) 

r A x J A ( 7 ) = I A (w © 7 ) - w A ( 7i )) • (6.44b) 

'- v -' 

AccJ ^(7) 

By the rigid body assumptions, we can disregard any integration constants. These will be finite and thus 
insignificant during a infinitesimal time duration. Now we have a simple relation between the collision 
impulse and the changes in linear and angular velocities 

J a ( 7 ) = m A Av c A ( 7 ), (6.45a) 

r A x J A (j) = I a Au A ( 7 ). (6.45b) 

Rewriting, we end up with a couple of equations telling us how to compute the effect of an impulse on a 
rigid body. 

Theorem 6.2 (Applying Impulse to a Rigid Body) 

Applying impulse J to a rigid body changes the linear velocity v and the angular velocity u>, with the 
amounts: 

Av = — , (6.46a) 

m 

Auo = I 1 (rxJ). (6.46b) 

We have omitted the 7 -parameter and .4-object label in Theorem 6.2 to make it more readable. 

Previously from ( 6 .12), we know how to compute the contact-point velocity on object A 

P ©7) = A irt) x r A + v c A (j). (6.47) 
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Therefore, the change in contact-point velocity must be 

Ap' 4 (7) = Aa)' 4 (7)xr' 4 + A^(7), (6.48) 

substituting (6.46) for A uj ' and A-o^, we have 

A p a ( 7 ) = (I a 1 ( r A xJ A (y))) x r A (6.49) 

rriA 

using the cross product matrix (18.65) and letting 1 be the identify matrix, we have 

Ap A ( 7 )= (^-1-(r A ) x I^(r A ) x ) J a ( 7 ). (6.50) 

If we do the same thing for object B, then we have 

Ap B h)= Q-l -(r B ) x I^(r B ) x ^J B ^). (6.51) 

Recall that J B = — J A so 

A p B (y) = - (~ 1+ ( r ' B ) X/ B 1 ( rB ) X ) jA (7)- (6-52) 

Using (6.13), we can write the change in relative contact velocity as 

A«( 7 ) = A p a ( 7 ) - A p b ( 7 ), (6.53) 

substitution of (6.50) and (6.52) into (6.53), we have 


Au(7) = ((i + 1 - + / B /r B '(r B ) x )) j a (A (6.54) 

-----V- 

K 

We have derived the algebraic form of the impulse-momentum relation. 

Theorem 6.3 (The Impulse-Momentum Relation) 

The change in relative contact velocity between two rigid bodies is given by 

Au( 7 ) = KJ(y), (6.55) 


where K is the collision matrix and is given by 
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In this form the impulse-momentum relation is often used as the contact model for algebraic collision 
laws. 

Theorem 6.4 (Properties of the Collision Matrix) 

The collision matrix K is 

• constant 

• symmetric 

• positive definite 

• invertible 

Properties of the Collision Matrix: 

Looking at (6.56) we see that the collision matrix is constant because all terms in this equation arc constant 
before, during, and after a collision according to the rigid body assumptions in Definition 6.1. 

Looking closely at (6.56) we see that it consists of the summation of three kinds of matrices, that is, 
we can write the collision matrix as 


K — M + A a + A B , (6.57) 


where 


M = (— + —) i. 

(6.58a) 

\m A rn B J 

lb- 

II 

1 

'T 

Ju 

X 

Ju 

X 

(6.58b) 

A B = -(r B ) X I B \r B )\ 

(6.58c) 

Looking closely at the types (6.58b) and (6.58c), we see that the transpose of Ax is 

A = -((<- x )A 1 (’- x ) x ) t . 

(6.59a) 

= -((r X ) X ) T dA T ((r X ) X ) T , 

(6.59b) 

X 

l-H 

X 

1 

II 

(6.59c) 

= A x . 

(6.59d) 


The last step in the derivation follows from the fact that the inertia tensor is a symmetric matrix and the 
cross product matrix is skew symmetric. In conclusion, the Ax-matrices arc symmetric. The iC-matrix 
consists of the sum of three symmetric matrices and is therefore itself symmetric. 
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Now we will show that A x -matrices arc positive semidefinite. Given any v f 0, we have 


v 


A x v = v T (^ — {r x YIx ( rX ) X ) v > 

= -v T (r X ) X I~ x 1 (r X ) X v, 


= (r X Yv I x ' (r x Y 


v 


= w 1 1 x ] w > 0 . 


(6.60a) 

(6.60b) 

(6.60c) 

(6.60d) 


The inertia tensor is known to be positive definite, that means for any vector w f 0, we have w 1 / v ' w > 
0, if w = 0, then obviously w 1 If 1 w = 0. Thus, we realize that 

v t A x v = w T If^w > 0 , (6.61) 


which is the definition of a positive semidefinite matrix. In conclusion, the Ax-matrix is positive semidef¬ 
inite. 

This means that TGmatrix is the sum of a positive definite matrix AT-matrix and two positive semidef¬ 
inite matrices A x and Ab, from which we conclude that the If-matrix is positive definite. And since K 
is positive definite it is also invertible. □ 


From (6.55), we have 


Au(y) = KJ(j), 
u(j) ~ u(ji) = KJ(j). 


(6.62a) 

(6.62b) 


If we differentiate with respect to 7 , then we get 

Y b) - - k Y 

= © J(7) - 

because and K arc constants. Since K is invertible, we have 


(6.63a) 

(6.63b) 


-r- J ( l) = K (6.64) 

07 07 

These arc the differential forms of the impulse-momentum relation. 

Theorem 6.5 (Differential Form of Impulse-Momentum Relation) 

Given two objects, the impulse-momentum relation can be written in differential form as either of the 
equations 


_ k-' Au b) . 


(6.65) 

( 6 . 66 ) 
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This form is often used as the contact model in incremental laws. Note that substitution of ( 6 . 66 ) into 
(6.35) yields 

W = f -itfy) • K^ 1 — u(y)dy, (6.67) 

Jo 07 

thus it becomes an integral only in the variable u( 7 ). 


6.1.3 The Permissible Region in Impulse Space 

Now we will start to describe the region in impulse space, where collision impulses yield physically 
plausible results. For this we will need two reference impulses and a definition of kinetic contact energy. 
Before doing so, we will introduce the shorthand notation Ui = u{pfi ) and Uf = u(yj). 

Theorem 6.6 (Plastic Sliding Impulse) 

A perfect plastic sliding impulse is given by 

( n T Ui \ 

Jl = - \wTctl) "■ l6M> 

The relative normal contact velocity is zero and tangential contact velocity is unchanged when this impulse 
is applied. 

Plastic Sliding Impulse: 

Substitution of ( 6 . 68 ) into (6.55) yields 


Uf = K.J 1 + Ui, 


= K 


( —n T u; \ 


\n T Kn ) 


n + Ui 


We’ll take the dot product with n, that is 


Uf • n = ( K 


( —n T Ui \ 




(6.69a) 

(6.69b) 


(6.70a) 


T 

= — n Ui 
T 

= — n Ui 


(.Knf 


n 


n T Kn 
T Kn 


+ Ui ■ n, 


n 


n T Kn 


+ Ui ■ n = 0 . 


(6.70b) 

(6.70c) 


In the last step we used that FF-matrix is symmetric. From the above, we see that the J/ impulse results 
in a final zero relative contact normal velocity as wanted. Now we’ll take the dot product with a contact 
plane tangent vector, t 


Uft = 



—n T Ui \ 

n T Kn J 


= Ui ■ t. 


n^t+Ui ■ t, 

0 


We see that the relative contact velocity in the contact plane is unchanged by the impulse J j. 


(6.71a) 

(6.71b) 

□ 
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J, 

\ 


Before 


After 


Figure 6.4: Perfect plastic sliding impulse makes normal relative contact velocity, but leaves tangential 
relative contact velocity unchanged. 


A perfectly plastic sliding impulse is illustrated in Figure 6.4. 

Theorem 6.7 (Plastic Sticking Impulse) 

A perfect plastic sticking impulse is given by 


Ju = —K l Ui. (6.72) 

The relative normal contact velocity is zero and so is the tangential contact velocity after applying this 
impulse. 

Plastic Sticking Impulse: 

Substitution of (6.72) into (6.55) yields 


Uf = KJ ii + Ui, (6.73a) 

= K ( -K~ 1 u i ) + Ui , (6.73b) 

= —KK l Ui + ttj, (6.73c) 

= — Ui + Ui = 0. (6.73d) 

That is, the final relative contact velocity is zero, which is what we wanted to show. □ 


A perfectly plastic sticking impulse is illustrated in Figure 6.5. We can define a quantity called the kinetic 
contact energy as follows. 
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\ _ 

- Before 


After 


Figure 6.5: Perfect plastic sticking impulse makes relative contact velocity zero. 


Definition 6.3 (The Kinetic Contact Energy) 

Given the relative contact velocity u and the collision matrix K, the kinetic contact energy E c , is defined 
as 

E c = ^ujK~ 1 Uf . (6.74) 

The kinetic contact energy looks nothing like the kinetic energy, as described in Section 22.3, which is 

Ekin = ^m A v c A ■rj + i (w A ) T I A w A + \m B v c B ■ v c ^ + ^ (w B ) T I B u B . (6.75) 

In fact, it is quite trivial that E c f E^ m , however, we arc interested in measuring changes in energy, not 
absolute values. It turns out that a change in the kinetic contact energy is equal to a change in the kinetic 
energy. Thus, the two types of energy arc equivalent except for a constant offset. 

Theorem 6.8 (Change in E^m is equal to change in E c ) 

If the change in relative contact velocity is A u, then the change in kinetic contact energy 

A E c = ^A u 1 K~ 1 Au, (6.76) 

and the change in kinetic energy AE kin are equal, that is, 

A Ekin = A E c , (6.77) 


as mentioned in [Chatterjee et al., 1998], 
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From the impulse-momentum relation (6.55), we have 

3 

1 

II 

(6.78a) 

J = K~ l u- K~ l Ui, 

(6.78b) 

J = K 1 u + «7n, 

(6.78c) 

J -Jr 7 = K~ l u, 

(6.78d) 

u = K(J- J n ). 

(6.78e) 

Inserting (6.78e) into (6.76) yields 

E C = \{K(J- Jn)) T K 1 (K (J - J H )), 

(6.79a) 

E c = \{J- J n ) T K(J- J H ). 

(6.79b) 


Now we have an expression for the kinetic contact energy, E c , in impulse space. 

Theorem 6.9 (The Kinetic Contact Energy in Impulse Space) 

Given the collision impulse J between two objects, the kinetic contact energy is given by 


E C = \{J -Jn) T K(J -J n ). (6.80) 

The impulse J n works as a reference point for kinetic contact energy. 

Image J = 0, then inserting into (6.80) gives 

E c = —J\\K J\\, (6.81a) 

= ^(K 1 u i f KK l Ui , (6.81b) 

= ^ujK 1 u i = E i . (6.81c) 

We have discovered the initial kinetic contact energy E. t , that is, the available amount of energy. It would 
be very unphysical if E c ever were larger than E t , since it would indicate an energy gain coming from 
nowhere. 

Theorem 6.10 (The initial Kinetic Contact Energy) 

The initial kinetic contact energy, E j, is given by 

E i = l;ujK i u i . (6.82) 

If we keep a fixed value of energy E, and try to find all vectors J yielding that energy, we will find an 
elliptical surface in impulse space. This follows from the fact that Tf-matrix is positive definite, as was 
proven in Theorem 6.4, and Proposition 18.11. 

B—. 
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Figure 6.6: Elliptic level set surface corresponding to the initial kinetic contact energy. 


Theorem 6.11 (Elliptical Kinetic Contact Energy Level Set Surface) 

Given a fixed value of kinetic energy E, all impulse vectors J yielding this energy lie on an elliptical 
surface in impulse space. 

The fact that a given energy level is an ellipsoid is very useful for a geometric interpretation of a collision. 
We can simply draw elliptic level set surfaces for fixed values of E. In particular, E r is interesting because 
if it has an impulse outside this initial energy ellipsoid, it indicates an energy gain, which is unphysical. 
Figure 6.6 illustrates the initial energy ellipsoid in two dimensions. Notice that the impulse J = 0 lies on 
the level set surface of E t and the impulse J\\ lies at the center of the ellipsoid. It is easily verified that the 
impulse 2 J\\ also lies on E r 


E c = - (2Jn - Jn) T K (2Jn - Jn) 


= -(K 1 u i ) T KK l Ui , 


-ujK l Ui = E t . 


(6.83a) 

(6.83b) 

(6.83c) 

(6.83d) 
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Figure 6.7: Given impulse J, resulting relative contact velocity from applying that impulse is equal to 
normal of energy level set surface. 

Given any collision matrix K, one can easily draw the initial energy ellipsoid by computing J n, then 
perform an eigenvalue decomposition on K. The eigenvectors will yield the orientation of the axes of the 
ellipsoid, and the eigenvalues the scalings along these axes (see Proposition 18.12). This implies that one 
draws a unit sphere, scales it by the eigenvalues of the FT-matrix, uses the eigenvectors of the TT-matrix 
as columns in a rotation matrix, applies the rotation to the sphere, and finally performs a translation by the 
impulse J\\. 

We’ll try to compute the normal of an energy level set surface E c , 

VE C = K (J — J n ), (6.84a) 

VE C = KK V (6.84b) 

VE C = u. (6.84c) 

So given an impulse J we can find the resulting relative contact velocity by looking at the normal at the 
point of the level set surface where J touches. This is illustrated in Figure 6.7. 

Theorem 6.12 (The Normal of the Elliptical Kinetic Contact Energy Surface) 

Given an impulse J, resulting in the kinetic contact energy E c , the resulting relative contact velocity is 
given by the normal to the elliptical surface corresponding to E c at the point where J touches the level 




Figure 6.8: Line of sticking, corresponding to impulses that results in zero final relative tangential contact 
velocity. 

set surface. That is 

u = VE C . (6.85) 

Theorem 6.12 gives us another powerful tool for interpreting the effect of certain impulses by geometric 
constructs. In the following, we will apply this tool to find a line of sticking and a plane of maximum 
compression. 

We define a line running through the point J\\ in impulse space and crossing all the kinetic energy 
level set surfaces at the two points where their normals are parallel to the contact normal. Every impulse 
that lies on this line yields a resulting final relative contact velocity with no components in the contact 
plane. This implies that the two objects will not move relatively in the tangent plane after the collision. In 
other words, they are sticking. We therefore call the line: line of Sticking. 

Theorem 6.13 (Line of Sticking) 

The line of sticking runs through the point J // and cuts the energy level set surfaces where their normal is 
parallel with the contact normal. 

The line of sticking is illustrated in Figure 6.8. Observe that on the line of sticking we always have Uf || n. 
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In similar fashion to the line of sticking, we define a plane in impulse space. The plane has the point 
Jii lying on it, and every point belonging to the plane results in final relative contact velocity that is 
orthogonal to the contact normal. We can write this as 

n T u = n T ( KJ + Ui) = 0 . ( 6 . 86 ) 

Looking at a specific energy level set surface in 3D, all points on the surface having a normal orthogonal 
to the contact normal lie on an elliptical circle and the circle is planar. According to (6.15), we call this 
plane the Plane of Maximum Compression. 

Theorem 6.14 (Plane of Maximum Compression) 

The plane of maximum compression is given by all points in impulse space where the normal of the kinetic 
contact energy level set surfaces are orthogonal to the contact normal. 

Note that in 2D, the plane of maximum compression is a line and not a plane. Another property of the 
plane of maximum compression is that it corresponds to impulses where the coefficient of restitution is 
zero. 

Theorem 6.15 (Zero Restitution) 

For all points J on a plane of maximum compression , the coefficient of restitution e, has the value 

e = 0. (6.87) 

The plane of maximum compression is illustrated in 2D in Figure 6.9. The impulse J/ must lie on the 
n-axis and it also must lie in the plane of maximum compression, so we can draw J j where the n-axis 
intersects the plane of maximum compression. The impulse 2 J j also lies on the n-axis and it conserves 
energy, so we can draw it where the n-axis crosses the initial kinetic contact energy level set surface, E{. 
Note the coefficient of restitution is one for 2 Jj. The two impulses J j and 2Jj are shown in Figure 6.10 
Observe that both J/ and J\\ lie in the plane of maximum compression. 

Finally, we will interpret the physical laws in terms of geometry in the impulse space. First there 
should not be any energy gain, this is equivalent to 

E c < Ei ( 6 . 88 ) 

and implies that any resulting collision impulse must lie inside or on the ellipse given by E j. Second, 
collision impulses cannot attract objects to each other; they must be repulsive. This implies that the 
collision impulse must not point in the opposite direction of the contact normal, that is 

n T J > 0. (6.89) 

Geometrically, this means that collision impulses are restricted to lie above or on the contact plane. Third, 
the collision impulse should result in a final relative contact velocity, such that the two objects will not 
penetrate, that is, we require nonpenetration, which means that 

n T Uf > 0. (6.90) 
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Figure 6.9: Plane of Maximum Compression. This corresponds to zero restitution. 


This implies that the impulse must lie on or above the plane of maximum compression. Finally, Coulomb’s 
friction law dictates that the impulse should be inside or on the cone defined by 

\\J — (n ■ J) ■ n\\ < p (n • J). (6.91) 

That is, the cone with apex at the origin and sides with slope equal to j r This cone is called the friction 
cone. Applying these four conditions, we end up with a region in impulse space. The region describes the 
set of all impulses, which make physical sense to apply in a single-point collision. This permissible region 
in impulse space is illustrated in Figure 6.11. We can learn several things from the permissible region 
of impulse space; first it is a closed 3D space, in fact it is a spherical bounded cone, intersected by the 
contact plane and the plane of maximum compression. To parameterize the permissible region requires 
three parameters. This is remarkable because most collision laws in use only have one or two parameters, 
which means that they cannot reach the full domain of the permissible region. Often they arc limited to a 
line or plane in the permissible region. 

Another observation is that it is possible to have a restitution value larger than one, without being in 
conflict with the physics. This is shown in Figure 6.11, the impulses 2 J / and 2Jn have e = 1. In fact, all 
impulses lying on the line between 2 J/ and 2J\\ have e = 1. This means that impulses lying above this 
line have e > 1. It is assumed that coefficient of restitution is a measure of how much the relative normal 




Figure 6.10: The two impulses J/ and 2 J j. 
contact velocity is reversed after the collision. 

The geometrical interpretation of physically valid impulses can be used to correct physical invalid 
collision impulses, simply by projecting these back onto the closest point on the permissible region. 

6.1.4 Examples of Algebraic Laws 

Now we will derive three examples of simple algebraic collision laws: Newton’s collision law, a two- 
parameter frictional law, and a frictional version of Newton’s collision law. 

6.1.4.1 Newton’s Collision Law 

We assume that the collision impulse is parallel with the n-axis, that is 

J = jn, (6.92) 

where j is the magnitude of the collision impulse. We will also assume that only the relative contact 
velocity in the normal direction changes. From Newton’s impact law we have 


«n(7/) 


eUn^i). 


(6.93) 
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Figure 6.11: The permissible region in impulse space. 


So the change in relative contact velocity in the normal direction can be written as 


Au ■ n = Uniif) - Unin), (6.94a) 

Am • n = -eu n (ji) - u n (n), (6.94b) 

Am • n = —(1 + e)ui ■ n. (6.94c) 

Looking at the impulse momentum relation we have 

Am = Kjn. (6.95) 


We arc only interested in the normal direction, so we take the dot product with the contact normal 


Am • n = (Kjn) ■ n, 

Au ■ n = n 1 Knj, 

isolating j we get Newton’s collision law 

. _ -(1 + e)ui ■ n 
^ n 1 Kn 

After we compute j by (6.97) we insert into (6.92) to find the resulting impulse J. 


(6.96a) 

(6.96b) 


(6.97) 
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6.1.4.2 A Two-Parameter Frictional and Energy Conserving Law 

Now we will look at collision law proposed in [Chatterjee et al., 1998]. It is simple, fast, and includes 
friction as well as reversible impacts. Furthermore, it is guaranteed to be in the permissible region of 
impulse space. In the Collision Law, one first computes an impulse J according to the equation 

J = (1 + e n ) J i + (1 + et) (Jn — J i ), (6.98) 


the coefficient e n is the coefficient of normal restitution and et is the coefficient of tangential restitution. 
The normal restitution coefficient tells us something about how elastic the collision is in the normal direc¬ 
tion. A value of zero means completely inelastic and a value of one means totally elastic. The tangential 
restitution coefficient tells us something about how much the tangential velocity is reversed. A value of — 1 
will completely remove any tangential effects, turning J into a newton impulse. A value of 1 corresponds 
to full tangential velocity reversal. 

Here the impulse J / and Jn arc both perfectly elastic collisions in the normal direction. The first 
does not affect the tangential direction, whereas the second brings the two bodies into sticking contact in 
the tangential direction, as can be seen from Theorems 6.6 and 6.7. For convenience, we will repeat the 
equations for the impulses. J j is computed as follows 


and Jn as 

Jn = —K~ 1 u. 


(6.99) 

( 6 . 100 ) 


As a final step in the Collision Law, it is tested if the impulse J lies outside the friction cone. This can be 
done with the following test: 

|| J — (n ■ J) n|| > /rn • J. (6.101) 

If the impulse lies outside, then it is projected back onto the friction cone by computing k. 


/a (1 T e n ) n • Ji 

J ii ~ (n- Jn) n|| - fin ■ (Jn - J/) ’ 


( 6 . 102 ) 


and finally the impulse J is computed according to 


J — (1 + e n ) J / + n (J n — J j). 


(6.103) 


6.1.4.3 Newton’s Collision Law with Lriction 

The next algebraic law we will describe is based on [Guendelman et al., 2003]. First we will assume 
sticking, that means for the final tangential relative contact velocity u tf = and the normal contact 

velocity u rif = u n ('if), we require that 


Ut f = 0, (6.104a) 

u n = —eu n {p/i)n. (6.104b) 
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Now we can compute the change in relative contact velocity A u = Uf — u t , and from impulse-momentum 
relation (6.55), we have 



3 

1 

•*-, 

i 

II 

l 's 

(6.105) 

Now we compute the normal and tangent components of the impulse, that is 



Jn — ^5 

J t = J J n • 

(6.106a) 

(6.106b) 

If 

|| Jt || ^ || Jn || i 

(6.107) 

then the impulse J is in the friction cone and we can use it. Otherwise we need to consider sliding 
friction. To consider sliding friction, we first determine the direction of sliding, that is, first we compute 
the tangential component of the initial relative contact velocity 


U ni = (ui ■ n) n, 

'U'ti — M Ili . 

(6.108a) 

(6.108b) 

Now the direction of sliding can 

be computed as the unit vector 



Uf 

t = „ ** 

II “till 

(6.109) 

and we define an impulse as 

J = jn- 

( 6 . 110 ) 

where j is the magnitude of the normal impulse. If we take dot product of the impulse-momentum relation 
and the contact normal we get 

u n j. — 'U'ni 4~ ^ IN J . ( 6 . 111 ) 

Using Newton’s impact law 

c'urii — d - tt K J , 

( 6 . 112 ) 

substitution of ( 6 . 110 ) yields 

-eu ni = u ni + n 1 K (jn- njt ). 

(6.113) 

Finally, we can solve for j 

- (1 4- e) u ni = n T K (jn - njt ), 

- (1 + e)u Ui = n Knj — n Kt^ij, 

- (1 + e) u ni = n 1 K (n - t/r) j, 

-(1 + e) u ni 
n T K (n — fit) 

(6.114a) 

(6.114b) 

(6.114c) 

(6.114d) 


Knowing j, we can insert it into (6.110) and compute the impulse J. 
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Figure 6.12: The contact force or impulse equivalence principle. The contact region on the left is replaced 
by a discrete finite set of points, consisting only of vertices lying on the convex hull of the contact region. 


6.2 Multiple Points of Collision 

The most interesting simulations in computer animation involve pairs of objects that touch each other at 
more than a single collision point. Even a simple cube falling flat onto a table top, has infinitely many 
points of contact on the face touching the table during impact. It seems that all the theories we have from 
single-point collisions arc useless. 

To avoid the problem of having infinitely many contact points, we can use the contact force/impulse 
equivalence principle. 

Lemma 6.16 (Contact Force and Impulse Equivalence Principle) 

Equivalent forces or impulses exist such that when they are applied to the convex hull of the contact region 
the same resulting motion is obtained, which would have resulted from integrating the true contact force 
or impulse over the entire contact region. 

The principle is illustrated in Figure 6.12. This is a very useful principle, especially in computer animation, 
where contact regions often arc polygonal areas. With this principle we have reduced our problem of 
infinitely handling many contact points to having to handle a finite discrete set of contact points only. 

In a simulator there arc generally two basic types of methods for handling multiple points of collisions: 

• Sequential Collisions 

• Simultaneous Collisions 

The names may be a little misleading. There is no concept of time propagation, as there is in compliant 
contact models [ Kraus et al., 1997], The names reflect how impulses arc computed or propagated at a 
single instance in time. The sequential collision approach is sometimes referred to in the literature as 
propagating impulses. The task of computing equivalent contact impulses and applying them to the rigid 
bodies is called collision resolving. 
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algorithm simple-sequential(C : Contacts) 
while collision 

collision = false 
for each c £ C 

if cis colliding then 
Resolve collision at c 
collision = true 
end if 
next c 
end while 
end algorithm 


Figure 6.13: Pseudocode of simple sequential collision resolver. 


We cannot really talk about one method being more correct or better than the other because it is 
possible to set up physical configurations of colliding rigid bodies where one of the methods computes the 
wanted motion and the other does not and vice versa [Mosterman, 2001]. 

Most simultaneous collision methods arc based on complementarity conditions at the velocity level, 
however it has been found that complementarity conditions at the velocity level have no fundamental phys¬ 
ical basis for collisions of even extremely stiff objects [Chatterjee, 1999]. We will only treat simultaneous 
collisions superficially, since Chapter 7 is devoted to velocity-based complementarity formulations. 

The sequential collision method is based on the idea of treating multiple concurrent single-point colli¬ 
sions as a series of single-point collisions. One can just continue to iterate over the single-point collisions 
and resolve them one by one using a suitable collision law until none of them is colliding [ Bara IT, 1989]. A 
simple sequential collision resolving algorithm is shown in Figure 6.13. The sequential collisions method 
suffers from some difficulties: 

• Different simulation results can occur if collisions are treated in a different order 

• Infinite calculations 

Recent research on sequential collision resolving [Chatterjee et al., 1998, Mosterman, 2001] is concerned 
with the order in which the collisions arc resolved and how to avoid infinite calculations. In Section 6.2.4, 
we present our own solution to the problem. In [Mosterman, 2001] it is shown that there is a relation¬ 
ship between the values of coefficient of restitution e and whether simultaneous or sequential collision 
resolving produces the physical expected results. 

Many consider the simultaneous collision resolving to be more physical, but there is no justification 
for it, except perhaps that it mimics the way a physicist would solve the problem: determine the degrees of 
freedom, analyze forces from a free body diagram, set up equations of motion, and solve for the motion. 

However, in comparison with the sequential collision resolving, the simultaneous collision resolving 
always produces the same global solution to a given problem. Whether the solution results in the physical 
expected motion is another question. The sequential collision resolving, on the other hand, solves the 
problem through a series of local solutions, thus there is a risk that it will never converge a global solution 
of the system. The sequential collision resolving is often beneficial over the simultaneous approach in 
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Figure 6.14: Two balls on table, causing sequential collision solver to enter an infinite loop. 


terms of computational expense. It is also capable of modeling propagation of forces/impulses through 
the mechanical system; the simultaneous approach totally lacks this capability. 

We’ll study the problems of sequential collision resolving through some simple examples. 

6.2.1 Nontermination of Sequential Collision Resolving 

A small ball is resting upon a fixed plane, the coefficient of restitution between the small ball and the 
plane is zero. A large ball is resting on top of the small ball; the density of the larger ball is 1,000 times 
larger than the small ball. The coefficient of restitution between the small and large ball is zero. The 
large ball is initially given a downward velocity of v. There is no friction in the configuration. The 
example configuration is illustrated in Figure 6.14. The example is completely one-dimensional, and the 
balls interact through central frictionless collisions, which mean that we can safely ignore any rotational 
motion. We have two contact points, the first contact point is between the table and the small ball, the 
second contact point is between the two balls. 
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Figure 6.15: Billiard balls wrapped by a frame. Initially ball E is moving to the right with speed v. 


The first colliding contact point is the second contact point between the two balls. Since we have a 
completely inelastic collision, from conservation of momentum we have 


lOOOn = 1000n a fter + Rafter, (6.115a) 

Rafter = (6.115b) 

where we have used v = ||r||, after having resolved the collision both balls move downward with the 
same speed of This means that the next colliding contact point is the first contact point. 

The plane has infinite mass, which means that after the collision resolving the small ball and the plane 
move downward with zero velocity. The next colliding contact point is now the second contact point. As 
before, from conservation of momentum we get 


1000 


1000 

1001 


V — 1000n a fter + "Wafterj 


Rafter 



(6.116a) 

(6.116b) 


Now the two balls move downward with a speed of (TTjTTT) 2 • The P atl:em repeats itself, and it is easily seen 
that after n passes over the two contact points the speed of the large ball will have decreased to (T7TTT) ” ■ 

From the example, it is also clear that the speed of the larger ball will never become zero, thus we will 
loop infinitely over the two contact points. Of course, in a computer simulation, we will at some point 
after a lot of computation time, encounter an underflow error and the computations will blow up in our 
face. 

As another example, imagine a series of small billiard balls, with a frame wrapped around them. The 
balls all have the same mass and the frame has the same mass as a single billiard ball. Furthermore, the 
coefficient of restitution is one between all objects and there is no friction. The configuration is shown 
in Figure 6.15. Initially, the rightmost ball E is given a speed v to the right. The ball will collide with 
the frame in a completely elastic collision, thus after the collision, the ball will have zero speed and the 
frame will move to the right with speed v. Now the frame is colliding with ball A in a completely elastic 
collision, meaning that afterward the frame will have zero speed and ball A will move to the right with 
speed v, ball A will now collide with ball B, and so on until ball D collides with ball E. We arc now back 
where we started. The pattern can thus repeat itself forever without ever changing. 
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Figure 6.16: Box falling onto plane, making completely elastic impacts at the four contact points a, b, c, 
and d. 

If the experiment is performed in the real world, one sees that the 1 inear momentum will distribute 
itself equally over all objects, implying that they will all move to the right with the speed of ^v. This is 
actually the solution a simultaneous method will compute. 

6.2.2 Indeterminacy of Sequential Collision Resolving 

Another issue with sequential collision resolving is that different results can be obtained depending on the 
order in which collisions arc resolved. 

As an example, imagine a box falling with one face flat against a fixed plane. For simplicity we will 
assume that there is no friction and that the coefficient of restitution is one between the box and the plane. 
Therefore, we expect the box to bounce back upward again without changing its orientation. In fact, if 
the simulation is run for long enough time, we expect to see the box jumping up and down on the plane 
without every changing its orientation. The configuration is shown in Figure 6.16. Upon impact we will 
have four contact points. We’ll label these a, b, c, and d. When the box touches the plane all contact points 
will collide; therefore we have to choose one contact point. Let’s pick a. Applying a normal impulse at a 
will make the box rotate with its axis going in the direction of b and d. 
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Figure 6.17: Three balls colliding in a completely symmetrical fashion. 


By symmetry, two things can happen next, either we choose the contact point c as the next colliding 
contact point to treat or we choice one of the b and d contact points. Picking the c contact point will correct 
some of the rotational motion that was caused by the normal impulse at a. It is very likely that none of 
the contact points will be colliding after having picked c, and the box will bounce upward almost with 
unaffected rotational motion. On the other hand, if we choose contact point b, the rotational motion will 
become even worse, and we may end up with a rotating box bouncing back from the plane. The point to 
be made here is that different things can happen dependent on the order we choose to resolve the contact 
points a, b, d, and e in. 

Heuristics for picking a deterministic and correct order arc available [Chatterjee et al., 1998]. These 
heuristics require that one examines the colliding contact points and sorts them according to some physical 
measure. This causes the sequential collision resolving to be more computationally expensive. 

However, it is possible to come up with configurations that exhibit special geometries, where no 
heuristic can be applied to resolve the problem of picking an order. Imagine, for instance, a ball at rest 
with a identical ball on its left side moving to the right and another identical ball on the right side moving 
to the left. The two balls on the sides arc mirror images of each other implying they will hit the center 
ball at the exact same time. At the time of impact we will have two completely identical contact points. 
There is no preference for choosing the left contact point over the right or vice versa. The configuration is 
shown in Figure 6.17. Another commonly encountered case exhibiting special symmetry is a billiard ball 
hitting a comer. At the time of impact, the ball touches the corner walls at two different contact points, 
but there is no way of favoring which contact point to resolve first. 

Notice that in both of the examples of special symmetry, picking one order over the other would lead 
to different simulation results. It is important to understand that the cause of this indeterminacy is not 
due to our lack of ability to find a good heuristic for picking an order. It is a basic physical problem. 
In [Chatterjee et al., 1998] it is suggested that a random ordering is the best way to deal with special 
symmetries. 

6.2.3 Simple Simultaneous Collision Resolving 

As an example of a simultaneous collision resolver we will derive the method proposed in [Baraff, 1989]. 
It is a simple method in the sense that it only considers normal impulses and normal contact velocities, 
and there is no friction. We refer the reader to Chapter 7 for more advanced modeling. 

If i is the index of a contact point then the collision impulse J, at that contact is 

T. — ji Hi ■ (6.117) 
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Figure 6.18: Simultaneous Contact. Showing how object B resolves collision between object A, and 
objects C and D. 


where n, is the contact normal of the fth contact point. Since impulses arc required to be repulsive, we 
have 

ji>0. (6.118) 

We restate Newton’s impact law as described in Section 6.1.1 for multiple contact points, let uq be the 
initial relative contact velocity in the normal direction of the fth contact point, and similarly Ui f is the 
final relative normal contact velocity, then 

u lf > — eu q, (6.119a) 

xii f +eui i > 0. (6.119b) 

The > is needed because more than one object can be pushing on a single object. To make sure we do not 
get too much impulse we will require 

u if > -eui t => ji = 0. (6.120) 

To gain some intuition, it can be helpful to study a small example, as shown in Figure 6.18. Here object 
B pushes object A away from object C and object D, so there is no need for impulses between objects A 
and C and objects A and D. We can rewrite (6.120) into a complementary condition, 

ji (u if + eu h ) = 0. (6.121) 

Knowing that u- lf is a linear function of all the impulses, j = [jo • • • ji ■ • ■ ] T , that is, Ui f ( j ) to be shown 
shortly, we can write a linear complementarity problem , for all i we require 


ji> 0, (6.122a) 

Ui f (j) + euit > 0, (6.122b) 

ji ( u if {j ) + eu^) = 0. (6.122c) 

Now we will look at how much the fth impulse changes the contact velocity at the j’th contact point on 
X. The situation is depicted in Figure 6.19. The change of contact velocity at the j’th contact point w.r.t. 




Figure 6.19: Multiple contact points. The change in relative contact velocity at the j’th contact point 
depends on impulses at both j’th and fth contact points. 

to object X due to the impulse at the i’th contact is 

Am/ = Av x + Au t x x r x , (6.123) 

where Av x = ±^-j, and Au x = +// (r x x n 7 j ji, where the sign depends on the direction of the 
contact normal. Substituting for Av x and A uj x and assuming X plays the role of B at the fth contact, 
we have 

Auji = + ( T x ] i r i X x ~ n i) ji) x r j A > (6.124) 

cleaning up 

Au jf = (j^ + (lx i r i X X »»*)) X r /) ji- (6- 125 ) 

That is, 

A u x = s-fji, (6.126) 

*£ = + ( J x ( r i X X - n 0) X r/) . 


where 


(6.127) 
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If X played the role of A then we should have used n, instead of —n,. To find the total change of the 
contact velocity at the j’th contact point we must sum over all the contact points in contact with X, 

A u* = A ujq 4-1- A ujf H-+ Arif*, (6.128) 

with our new notation 

A U* = Sjojo 4-f Sjfji 4-F Sjn./n- (6.129) 

Now we must also find the change of contact velocity w.r.t. object Y, similar derivation will lead to 

AWj 1 = Sjo'jo + • • • 4- s ? ji + ■ ■ ■ + s j* n jn- (6.130) 

Now assume X played the role of A at the j’th contact point, then Y must be B at the j’th contact point, 
so the change in relative contact velocity is 

A Uj X - Art/ = (3j$ - s/ 0 )j 0 +-F (sjf - s/)j* 4- • • • + (s * - s^ n )j n . (6.131) 

We arc only interested in the normal direction, so we can simplify our derivation by taking the dot product 
with the contact normal, that is 

Uj f = -©j'ojo 4- • • • 4- Ajiji + ■ ■ ■ 4- Aj n j n 4- Uj i , (6.132) 

where 

A A = ( s fi ~ s )i) ' n r (6.133) 

Writing up the linear system for all contact points results in 

Aj 4- b > 0 compl. j > 0. (6.134) 

Figure 6.20 illustrates pseudocode for setting up the A-matrix and the b-vector. 

6.2.4 Robust and Stable Sequential Collision Resolving 

In this section we give the technical details for an efficient stable and robust implementation of a sequential 
collision resolver. It overcomes some of the problems with infinite looping at the price of physical realism. 
The method still yields plausible results. The method is based on the ideas in [Chatterjee et al., 1998]. 

The method described will need a collision law, which has a coefficient of normal restitution, e. Other¬ 
wise any collision law can be used. During initialization all single point collisions, i.e., the contact points, 
arc inserted into a heap data structure S. The contact points arc sorted based on their relative normal 
contact velocity, u n , computed as in (6.14). If u n < 0, then the objects arc colliding at the contact point 
and we need to apply an impulse J to the objects. 

As suggested in [Chatterjee et al., 1998], the contact point with the smallest relative normal contact 
velocity is obtained from the heap S. This contact point is called the minimum contact point. Getting the 
minimum contact point is a constant time operation, as the minimum element in a heap often is the first 
element. If the minimum contact point is colliding, an impulse is applied to resolve the collision. If on 
the other hand, the minimum contact point is noncolliding, then by the heap property of S, there cannot 
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A = new (n x n)-matrix 
b = new n-vector 
C = set of all contacts 
for each j in C 
b[j] = (1 + e)u ji 
for each i in C 

Atj] [1] = 0 

if one of the bodies of i is body A of j 

let X be the body of i that is body A of 

A [ j ] I i ] +=sfi -rij 

if one of the bodies of i is body B of j 

let Y be the body of i that is body B of 

A [ j ] [ i ] -= sji ■ rij 


j 


j 


Figure 6.20: Pseudocode for setting up linear complementarity problem for simple simultaneous collision 
resolving. 


be any other contact points with a relative normal contact velocity less than the minimum contact point. 
Therefore, there cannot be any more colliding contact points and the algorithm can terminate. If a collision 
impulse is applied, then the relative normal contact velocity is recomputed, for all contact points having at 
least one object in common with the minimum contact point. These contact points arc said to be affected 
by or dependent on the minimum contact point, since their relative contact velocity depends on the two 
incident objects at the minimum contact point. After having updated all the dependent contact points, 
the heap S needs to be updated, in order for the heap property to be fulfilled. Hereafter, the algorithm 
obtains the minimum contact point from the heap S, and repeats the above steps until termination. A 
pseudocode version of the algorithm is shown in Figure 6.21. If an implementation is made precisely as 
the pseudocode outlines, it will quickly cause trouble in practice. A test like u n > 0 is difficult to apply 
due to precision errors. One solution to the problem is to use a threshold value, e > 0, such that the test 
becomes 

u n > —e. (6.135) 

The threshold value e should be a small positive number indicating the accuracy of the floating-point 
operations. In practice, the test works excellently, especially in combination with a collision envelope, 
which it magnitudes greater than the precision. 

The problem with the pseudocode algorithm is due to the physics. As explained in Section 6.2.1 there 
arc configurations causing an infinite sequence of collisions. The important thing to notice is that if e < 1, 
then the relative contact velocities arc always monotonically decreasing. In [Chatterjee et al., 1998] it is 
therefore suggested that when infinite loops happen, a truncation or extrapolation can be used to obtain an 
answer. 

However, it’s not easy to determine when an infinite sequence of collisions occurs. An ad hoc solution 
to the problem is to test whether the current value of the relative normal contact velocity has decreased in 
absolute value to some specified fraction of the initial smallest relative normal contact velocity. A fraction 
of 1/1000 usually works well in practice. A smaller fraction means that the collision resolving will take 
more computation time, and a larger fraction means less correct simulation results. Notice that the test for 
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Algorithm sequentiall (C :Contacts) 

Heap S 

for each c £ C do 
compute u n for c 
put ( u„,c) into S 
next c 

while forever 
(u n , c) = min(yS) 
if u n > 0 then 
return 
end if 

compute J for c 
apply J to objects inc 
for Md € C, d sharing object with c do 
compute u n for d 
update heap with (u n ,d) 
next d 
end while 
End algorithm 


Figure 6.21: Sequential collision resolving based on minimum relative normal contact velocity. 


an infinite loop is based on a global threshold value and a local test for each contact point. The local test 
is performed each time the minimum contact point is obtained from the heap S. 

However, the infinite loop test will only detect infinite loops in configurations where truncation could 
resolve the problems, such as the two balls on a table configuration, as shown in Figure 6.14. Configu¬ 
rations such as the framed billiard balls in Figure 6.15 will not be detected. To circumvent this problem, 
a simple counter test is introduced; each contact point keeps track of the number of times it has been 
resolved. When the minimum contact point is obtained from the heap, its counter is incremented by one, 
afterward it is tested if the counter exceeds a user-specified maximum count. Care must be taken not to set 
the maximum count too high or else an excessive amount of computation time can be wasted. If set too 
low the collision resolver may detect perfectly solvable configurations as containing infinite sequences 
of colliding contact points. Usually values in the range of 20-100 works well for computer animation 
purposes. Initially, the resolving counter is set to zero for all contact points. 

Figure 6.22 illustrates the modifications to the pseudocode in Figure 6.21. When an infinite sequence 
of colliding contact points is detected, an evasive action has to be performed or else there is a risk for a 
never-ending computation. If e < 1, then it seems sensible that kinetic contact energy will continue to 
dissipate a little for each an every collision that is resolved, until there is no more kinetic energy left. The 
limit at which there is no more kinetic energy corresponds to the relative normal contact velocity being 
zero, that is, u n = 0. We know from theory that setting the coefficient of restitution to zero, e = 0, 
will make u n = 0 when resolving one contact point. Therefore, an evasive action is simply to set e = 0 
when computing the impulse for a contact point, which has passed the infinite loop test. Setting e is called 
truncation and the computed impulse is called a truncated impulse. 

Theoretically, setting e = 0 should give u n = 0. In practice however, numerical precision often causes 
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Algorithm sequential2 (C :Contacts) 
Heap S 

for each c £ C do 
compute u n for c 
counter(c) = 0 
put (u n ,c) into S 
next c 

(u n ,c) = minis) 

^fraction — ( I / I 000) Un 
A/nax = 20 
while forever 
(u n , c) = min(S) 
ifu n >0 then 
return 
endif 

counter(c) = counter(c) + 1 

if Un A C fraction Or counter(c) > A r max then 
...handle infnitite loop... 
else 

...compute impulse as usual.. . 
end if 

end while 
End algorithm 


Figure 6.22: Modification of sequential collision resolver to detect infinite sequences of colliding contacts. 


u n to be slightly nonzero. During the update of contact points in the heap, it is better simply to set u n = 0 
when updating a contact point that has just been resolved with a truncated impulse. A Boolean flag is 
needed on each contact point, so a contact point can be marked as having been resolved by a truncated 
impulse or not. 

If instead, u n is recomputed by (6.14) when a contact point has been truncated it might occur that the 
value is slightly lower than the precision threshold limit. This will cause the algorithm to do several series 
of truncations on the same contact point before the single-point collision has been resolved. The result is 
more iterations and slower simulation. 

To counter artifacts from numerical precision it is often also advantageous to truncate the post collision 
velocities of the objects from a contact point where a truncation impulse has been applied. Pseudocode 
for the velocity truncation is shown in Figure 6.23. The final pseudocode version of sequential collision 
resolver algorithm is shown in Figure 6.24. 

The sequential collision resolver algorithm in Figure 6.24 will be able to to detect an infinite loop in 
configurations like the two balls on the table and the framed billiard balls. It should be noted that truncation 
impulses arc not the whole answer to making a bulletproof evasive action on an infinite loop. Notice that 
the coefficient of restitution is zero for ah contact points in the two balls on a table configuration, and this 
results in an infinite loop. 

The algorithm will not always return the physical expected result. If detection of infinite loops is 
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algorithm truncate (v, u> ) 
If |d x | < £ Then 
v x = 0 
End If 

If \v y \ < £ Then 
Vy = 0 
End If 

If |u z | < £ Then 
v z = 0 
End If 

If \uj x \ < £ Then 
0J X = 0 
End If 

If \<jJ v \ < £ Then 

IjJy = 0 

End If 

If \ui z \ < £ Then 
U! z = 0 

End If 

end algorithm 


Figure 6.23: Pseudocode for postvelocity truncation to counter numerical precision artifacts. 


over-eager, then the algorithm will have an effect of artificially decreasing the coefficient of restitution. 
Typical large values e > \ returns plausible results, whereas values e < i seem to be clamped to zero, 
and values in between seem artificially damped. These value ranges arc typical for the values listed for the 
fraction-test and maximum resolve count test. Other values for these constants would result in different 
behavior. 

The algorithm is excellent for use in computer games or computer animation, since it is robust and 
stable, and can be made to run very fast if one is willing to sacrifice physical realism. It should be noted 
that a good heap data structure is crucial for good performance. 

It is possible to make variations of the algorithm. For instance, contact points could be sorted in the 
heap data structure based on penetration depth or some other physical or geometric quantity. Also, other 
evasive actions could be taken in case an infinite loop is detected. For instance, one could switch to a 
simultaneous collision resolving method. 

The algorithm can have a serious impact on configurations with e = 1; if an infinite loop is detected it 
will simply set e = 0. In a configuration like the one shown in Figure 6.15, all objects will come to a halt, 
which is not the wanted simulation result. To circumvent this problem one could add an extra test on the 
value of the coefficient of restitution to detect what kind of infinite loop that has been detected. If e < 1 it 
is safe to apply a truncation impulse, if e = 1, another solution method could be applied. 
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Algorithm sequential! (C :Contacts) 

Heap S 

for each c £ C do 
compute u n for c 

counter(c) = 0 
truncated(c) =false 
put ( u„,c) into S 
next c 

( u„,c ) = minis') 

^fraction — { I / I 000) Un 

IVmax = 20 
while forever 
(u„, c) = min(S) 

counter(c) = counter(c) + 1 

if u n >0 then 
return 
endif 

if Un > ^fraction or counter (c) > Af max then 
set e = 0 
compute J for c 
apply J to objects inc 

truncated(c) =true 
else 

...compute impulse as usual.. . 
end if 

for Md € C, c' sharing object withcdo 

if truncated(c') then 
Un — 0 

truncate(i>A, wx) 
truncate(u_e, u > b ) 
truncated(c / ) =false 
else 

compute u n for d 
end if 

update heap with (u n ,d) 
next d 
end while 
End algorithm 


Figure 6.24: Final pseudocode version of sequential collision resolver. 
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6.3 Conservative Advancement 

In the work [Mirtich et al., 1994, Mirtich et al., 1995, Mirtich, 1996] a new simulation paradigm emerged 
building on the early ideas of [Hahn, 1988]. The paradigm was named impulse-based simulation. Later, 
the work was extended to hybrid simulation [Mirtich, 1995], a paradigm combining constraint-based and 
impulse-based simulation. The main focus here is the time-stepping method presented in [Mirtich, 1996]. 

Looking at rigid bodies at a microscopic level, all kinds of contact can be modeled by collision im¬ 
pulses, even something like a cup at rest on a table. This type of contact is called static contact. Static 
contact is modeled as a series of highly-frequent collision impulses occurring between the two objects in 
static contact. The frequency of the collision impulses is so high that it is nearly impossible to distinguish 
the individual collision impulses. The limiting case with infinite frequency will be equivalent to com¬ 
puting the time-integral of the contact forces (6.18) over a finite time-step At. The force time-integral is 
confusingly called impulse, but it is not a collision impulse. 

In practice, we do not have the computational resources to compute an infinite series of collision 
impulses, therefore we use a collision envelope around each object. The collision envelope can be thought 
of as a thick surface of an object. It is a model of the atomic layer of an object that comes into contact 
with the atomic layer of another object. Thus, it is the atoms in these layers that arc bouncing against each 
other during static contact. It is important to realize that objects arc defined as touching when they arc 
within each other’s collision envelopes. An actual distance of zero between the geometries of two objects 
is never seen. 

Frequent collisions occurring during static contact is now modeled by applying an impulse only at 
the closest point between two objects. The closest point will lie inside the collision envelopes of the 
objects. After having applied the collision impulse, the velocities of the objects will have changed, such 
that if we advance the time of the simulation by integrating the free-moving body state function (22.81) 
of the two objects, the objects will move away from each other at the closest point, regardless of how 
small a time-step we integrate forward. This will mean that at some point in the future a new closest 
point within the collision envelopes will be found. Thus, static contact is modeled as an object that 
vibrates on top of another object. If the collision envelope is small enough the vibration will occur at a 
frequency where a human observer will not notice it. Figure 6.25 illustrates the idea using an exaggerated 
collision envelope. In [Mirtich et ah, 1994, Mirtich et ah, 1995, Mirtich, 1996] a time-stepping method 
is presented that computes an estimate for a time-step, that can be used to integrate the rigid body state 
functions describing the free motion of the objects without causing penetration or zero distance of the 
object geometries. The time-step is found by computing a conservative lower bound for the Time of 
Impact ( TOI ) between all potential colliding object pairs. 

A TOI value should be easy and fast to compute, since it is expected to be recomputed many times. In 
fact, it must be recomputed every time two objects collide or whenever the time-stepping has reached the 
time of a TOI. Thus due to performance considerations, it is crucial that a TOI value is computationally 
inexpensive to compute. A TOI value should always be a lower limit to the true time of impact and the 
lower limit should converge toward the true time of impact the closer the objects arc. This is what we mean 
by a conservative TOI. Computing TOI values in this way will ensure that a collision is never missed, and 
that it is safe to integrate forward in time to the minimum TOI value. 

Figure 6.26 illustrates the basic idea behind the TOI time-stepping method. Notice that the stepping 
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(a) First atomic collision 


(b) Second atomic collision 



Closest Colliding Contact ^ 0 - - - - 



B 


(c) Third atomic collision 

Figure 6.25: The collision impulse model of static contact. A box A is resting on top of a fixed box B. 
The static contact is modeled as a series of collisions occurring between the closest point between A and 
B. Thus, box A is in fact vibrating on top of box B. The smaller the collision envelope, the higher the 
vibration frequency, and an end user will not notice the vibration. 
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algorithm run (t, At) 

£final = t + At 
do 

do collision detection 

resolve colliding contact points 

ttoi — min(compute toi's for all object pairs) 

At — HliI!(/final hoi, /final t) 

integrate forward by At 
t = t + At 

while t < ffinal 
end algorithm 


Figure 6.26: Pseudocode for simple TOI time-stepping. 


method is invoked with a wanted time-step size, which indicates the end-time for the next frame of the 
simulation. That is the simulation time for the next frame that should be rendered in an animation. The TOI 
time-stepping method belongs to a group of time-stepping algorithms termed Conservative Advancement 
because time only runs forward, unlike in Retroactive Detection, where time is rewound when a simulation 
goes bad. There arc two major problems with the time-stepping method outlined in Figure 6.26. First, 
how should a TOI value be computed? Second, recomputing TOI values between all pairs of objects takes 
quadratic time, which is computationally intractable in large-scale simulations. 

6.3.1 Time of Impact (TOI) Computation 

Assume that the naiTow-phase collision detection algorithm (see Chapter 13) returns the closest points 
between two rigid bodies. Unfortunately, these two points do not provide enough information themselves 
for estimating the position of an actual future impact between the two objects because the objects may be 
moving away from each other at the closest points, which indicates that some other points on the objects 
arc moving toward each other. This is illustrated in Figure 6.27. However, if we assume that the objects 
arc convex, the closest points can be used to give us some information. If the objects arc convex then any 
future colliding points must at least travel the distance between the closest points before they can collide. 
This is shown in Figure 6.28. If we let the function d A {-) be the distance traveled by the colliding point 
x A as a function of time, and similar d B {-) is the traveling distance for x B , then the distance traveled by 
both the colliding points, d(-), at the true time of impact t c is 

d(t c ) = d A (t c ) + d B (t c ) > ||d||, (6.136) 

where d = p A — p B . If (6.136) is solved for the time t yielding equality, then the solution will be a lower 
bound for the true time of impact t c . As objects come closer, the colliding points will end up being the 
closest points just before the actual impact. Thus, the lower bound becomes a better and better estimate 
for the time of impact the closer the objects come. 

Unfortunately, the functions d A (-) and d B {-) are unknown, because the colliding points x A and x B 
arc not yet known, only the closest points p A and p B arc known. To get around the problem, the functions 
d A (-) and d B (•) arc approximated by two other functions giving a conservative upper limit for the traveling 
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Figure 6.27: A bad case for TOI computation, two closest points arc moving away from each other. 
Flowever, there exist two other points moving toward each other. 



Figure 6.28: The actual colliding points xa and xb must at least travel the distance, d, given by the 
closest points p A and p B before they can collide. 
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distance. Notice that it is important that the traveling distance is an upper bound in order for the time 
estimate to be a lower bound. 

From (6.12) we can write the velocity of any point of the surface of an object X, as 

Ux(t) = uj x (t) xr x + v c x (t), (6.137) 

where r x is the body-fixed vector of the surface point. 

If we assume that the forces acting on an object arc constant during the time-step A/ and that there 
is no torque acting on the object, then the object will move along a ballistic trajectory. This may sound 
prohibitive, but it is the same motion as an object moving freely under gravity, thus, it is a good assumption 
for computer animation purpose. Under this assumption the linear velocity of the center of mass is given 
as 

^em(^) = ^cm(^cur) T 9 [/ — /cur] j (6.138) 

where / cur is the current simulation time, and g is the gravitational acceleration. The speed of any surface 
point on object B in the direction of d can now be found by substituting (6.138) into (6.137) and taking 
the dot product with d, 

u B (t) ■ d = (a? B (t) x r B ) ■ d + v c B (t cm ) ■ d + g ■ d[t - Z CU r] • (6.139) 

The surface velocity of any point on object A is almost identical except that —d should be used. Since the 
actual collision points arc unknown, r A and r B are unknown. Flowever, since an upper bound is wanted, 
any surface point can be used, which maximizes the term (u> x (t) x r x ) • (Ad). 

The surface point fulfilling this requirement must be the point lying farthest away from the center of 
mass. Let the distance of this surface point be given by r A ax , which can easily be precomputed before the 
simulation starts. The motion of the object is assumed to be torque free, therefore, it’s possible to compute 
an upper bound on the magnitude of the angular velocity, ec A ax , derived in (6.149). Combining these two 
facts yields 

( w x r x ) ■ (Ad) < wfLuL- (6.140) 

Using (6.140) in (6.139) gives the upper bound on the surface point velocity, 

U B (t) ■ d < Wma X r m ax + ^cm(/cur) d + g - d[t- / cur ] , (6.141) 


thus integrating (6.141) from / cur to /, yields an upper bound on the distance traveled by any surface point 
on B, that is, 

rt rt 


/ ( u(t ) x d) dr < j ((-u c A (/ cur ) +g[t- t cu r ]) • (Ad) + u X ax r x ax ) dr, (6.142a) 

'J £cur J tcm 

= \d ■ (Ad) [t - / cur ] 2 + ©cm (/cur) ■ (±d) + X r m aX ) [/ “ *cur] ■ (6.142b) 

Introducing the constants, 

Ax = -g • (Ad) , (6.143a) 

B x = v c x (/cm) ■ (Ad) + w A ax r A x , (6.143b) 
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where “+” is used, if X = A, and ” otherwise, the approximation to (6.136) is 

{AA + Ab) [t — /cur] + {Ba + Bb) [/ — /cur] = ||d|| • (6.144) 

This is a second-order polynomial that is easily solved. The TOI value is now given as the smallest positive 
root of (6.144) plus the current simulation time. If no such roots exist then the objects cannot collide and 
the TOI value can be set to infinity. 

Note that if the distance between the two objects ever becomes zero, then d = 0, the trivial solution 
to (6.144) is / = 0, causing the algorithm in Figure 6.26 to enter an infinite loop, since the smallest TOI 
value will always be the same as the current simulation time. 

Computing the value of can be done by noting that the rotational kinetic energy of the object must 
be conserved, since there is no torque acting on the object. The rotational kinetic energy, E, computed in 
the body frame of the object is given by 


E = ^cj(/) T / body cj(/), 


(6.145) 


for convenience the body label X has been omitted. The term I body is the inertia tensor w.r.t. body frame. 
It is thus diagonal and constant. Writing (6.145) element-wise yields 


E= \ \u x (t) Uy(t) LO z (t) 1 

'4 

0 

1 

O 


U x {t) 

0 

ly 
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Uy{t) 

z 

0 

0 

4 


u z (t)_ 


which can be reduced to 

2 E = I X UJ X (ty + IyU)y{t ) 2 + I Z LO Z [t ) 2 , 

this is easily rewritten into the equation of an elliptical level set surface 


(6.146) 


(6.147) 


U x (t ) 2 LOyit ) 2 LO z {t ) 2 

2E ~T~ 2E ~T~ 2 E 

lx ly I z 


(6.148) 


This level set surface defines all possible w(Z), picking the m(t) with the largest magnitude will yield a 
value for cj max . From geometry it is clear that ||u>(/)|| is largest along the major axis of the ellipse, which 
means 


k^max 


2 E 


min {I x ,I y ,I z }' 


(6.149) 


6.3.2 Time of Impact Heap 

The second problem of the algorithm in Figure 6.26 is that it is computationally intractable to compute 
TOI values between all pairs of objects. Fortunately, it turns out that this is not necessary. 

If a heap data structure is used to store object pairs sorted on their TOI value, then the minimum TOI 
value can easily be accessed in constant time. Only the TOI value between the object pairs corresponding 
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to the minimum TOI needs to be recomputed, as well as all other TOIs having at least one object in 
common with the minimum TOI object pair. All other TOI values arc known to be larger than the minimum 
TOI value and the rigid body states of their object pairs arc unaffected by a possible collision taking place 
at the contact point corresponding to the minimum TOI. Thus, their future collisions are unaffected and 
the TOI values are still valid lower bounds. After having updated the TOI values of all affected object 
pairs, the TOI heap needs to be updated in order for the heap property to be fulfilled. 

Initially, it is a bad idea to compute TOI values for all object pairs in order to initialize the heap. 
Instead, only TOI values for objects that arc in close proximity are kept in the heap. If sweeping volumes 
arc used in the broad-phase collision detection algorithm as described in Section 12.5, then the results 
from the broad-phase collision detection algorithm yield object pairs that may collide in the near future. 
Whenever an object pair is reported by the broad-phase collision detection algorithm, which is not already 
in the heap, it is inserted into the heap. 

To keep the storage requirements for the heap from accumulating to quadratic size, it is beneficial to 
remove object pairs from the heap when they arc far from each other. First intuition might suggest simply 
to remove object pairs when they arc no longer reported by the broad-phase collision detection algorithm. 
However, this can have fatal side-effects. Imagine a ball bouncing up and down on a fixed plane. The 
TOI value between the plane and the ball will constantly be added and then removed from the heap, thus 
yielding a substantial overhead in maintaining the heap data structure. A better solution is to approach the 
removal of heap elements in a lazy manner. That is, TOI values arc only removed from the heap if the 
object pair of the minimum TOI is no longer close to each other. 

Figure 6.29 illustrates how the TOI heap works and Figure 6.30 shows a pseudocode version of the 
algorithm. 

In most cases, the TOI heap time-stepping method is extremely fast, especially in cases with many 
fast-moving objects. It is beneficial in computer games such as flight simulators, where there is almost 
no static contact. However, the algorithm will have difficulties with rotating objects that have large aspect 
ratios. For instance, a see-saw configuration can exhibit slow convergence for the TOIs corresponding to 
impact with the ground. This is because ?’ max is very large, and d, is small. 

The same behavior of slow convergence of the TOIs, can be seen when objects arc in static contact. 
Here, d becomes very small causing the computed TOI-values to be very small. Stacking of objects can 
cause a simulator to come to a halt, due to the computations involved in updating the TOI values and 
taking time-steps close to zero. 

Note that the entire algorithm actually only needs to resolve a single-point collision between any two 
objects. There is no need for sequential or simultaneous collision resolving. 


6.4 Fixed Stepping with Separation of Collision and Contacts 

Besides the convergence problems of the time-stepping approach described in Section 6.3, it has other 
serious flaws. It turns out that these flaws stem from the basic nature of doing time-integrating of the free 
motion of the rigid bodies and then resolving collisions. Any time-stepping method taking this approach 
to impulse-based simulation will thus suffer from the same flaws. In [Guendelman et ah, 2003], the basic 
nature and cause of these flaws was explained and a new time-integration method was proposed as a 
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Bodies 



Figure 6.29: Conceptual drawing of how the TOI heap works to control the time-stepping. 


solution. This new time-stepping method is the subject of this section. We will discuss the problems of 
implementing an efficient and robust simulator based on this time-stepping method. 

In a traditional time-stepping method , the equations of motion (22.81) arc integrated. That is, the 
first-order coupled differential equations 


d 

—r, 

dt 

d 

— v, 


— Wm 

F 

m 


d 1 r n I 

— 9 = - [ o,^ 9> 


dt 


~ Ec.m — T, 


(6.150a) 

(6.150b) 


arc numerically integrated. Here, r cm is the position of the center of mass, v cm is the linear velocity of the 
center of mass, and F is the total force acting on the center of mass, L cm = I to is the angular momentum 
w.r.t. center of mass, I is the inertia tensor, and ui is the angular velocity. r cm is the torque, q is the 
orientation represented as a quaternion after the numerical integration collision resolving is performed. 
In Figure 6.31 an example simulation of a box on an inclined plane using the traditional time-stepping 
strategy is shown. In the example, the coefficient of restitution is one, and the coefficient of friction is 
large enough to prevent the box from sliding down the plane. Initially the box is at rest, however, after 
having performed the first integration the box will have gained linear velocity, due to gravity acting on 
the box. During collision resolving a collision impulse is applied to the box and the collision impulse 
will change the linear velocity of the box, so it no longer will be colliding with the plane. However, since 
the coefficient of restitution is one, the linear velocity after collision resolving is pointing in the normal 
direction of the plane. This means when integration is performed again, the box will fly up in the air. 
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algorithm toi-heap-time-stepping() 
while TOI (b,b') at top of heap=f C ur 
narrow-collision (6, b') 

contacts = contact-determination(6, b 1 ) 
collision-resolve(contacts) 
recompute all TOI's where 6 is part of 
recompute all TOI's where b’ is part of 
if b and b' no longer close 
remove TOI (6, b') from heap 
end if 
end while 

^next =TOI at top of heap 

At — /-next /-cur 

overlaps = broad-phase (At) 
for each overlap (b,b') in overlaps 
if not TOI (b,b') exist 
compute TOI (ft,//) 
insert TOI (6, b') in heap 
end if 

next overlap (b,b') 

tnexl = TOI at top of heap 

integrate forward by /next /cui 
end algorithm 


Figure 6.30: Pseudocode illustrating how to use the TOI heap for time-stepping. The pseudocode replaces 
the body of the while-loop in Figure 6.26. 


gravity will then work on the box, and eventually the box will fall down on the plane again, but it will hit 
the plane at a lower position. Thus, the box seems to bounce downward on the plane. The same artifact 
causes objects resting on the ground to vibrate, making it impossible to create high stacks of objects. 

The novelty of the work in [Guendelman et al., 2003] is to split the numerical integration of (6.150) 
into two separate phases causing a separation of the collision resolving from the contact handling to give 
the following simulation loop: 

1. Collision resolving 

2. Advance the velocities using (6.150b) 

3. Contact handling 

4. Advance the positions using (6.150a) 

The step given by (6.150b) is termed the velocity update and the step by (6.150a) the position update. 
They arc both handled by a single explicit Euler step. 

In [Guendelman et al., 2003] a different approach is taken to collision resolving: a fixed number of 
iterations arc performed over all the contact points. Thus, after having performed the collision resolving, 
there might still be colliding contacts. 
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(c) Collision resolving in first time-step (d) Result for first time-step 




(e) Integration for next time-step (f) Several time-steps later 

Figure 6.31: Box at rest on inclined plane, coefficient of restitution is one, friction is large enough to keep 
box from sliding. However, due to the nature of the time-stepping method, box starts to bounce down 
inclined plane. 
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algorithm time-step (At) 
for 5 iterations 

collision resolving 
next iteration 
velocity-update 
tmp = epsilon 
epsilon = -1 
for 10 iterations 
epsilon += 1/10 
contact-handling 
next iteration 
shock-propagation 
epsilon = tmp 
position-update 
din — tcur T At 
end algorithm 


Figure 6.32: Pseudocode of the separated collision and contact time integration method. In 
[Guendelman et al., 2003] 5 collision resolving iterations and 10 contact handling iterations arc used. 


Similarly, when doing the contact handling, a fixed number of iterations is used. Contact handling is 
different from collision resolving in several aspects. For instance, the coefficient of restitution is set to 1 
and incrementally raised so that in the last iteration it has value zero. This has the effect of slowing down 
objects before they arc stopped. Also, more iterations arc used than in the collision resolving. 

Shock propagation is done immediately after the last contact handling iteration. Shock propagation 
also uses a restitution coefficient of zero. Furthermore, objects in a stack arc cleverly fixated and unfixated 
while iterating over the contact points. Together, this has the effect of propagating a shock through a stack 
of objects, correcting faulty behavior. Note that contact handling is needed before shock propagation, 
otherwise objects will not feel the weight of each other. 

As with the collision resolving, colliding contact points arc not guaranteed to be completely resolved 
after the last contact handling and shock propagation iterations, even though the coefficient of restitution 
is zero in these cases. This should be clear from the two balls on the plane example in Section 6.2.1. 

Figure 6.32 shows a pseudocode version of the new time-stepping method. 

The predicted positions of objects arc used for the collision detection queries. This means that before 
each iteration of the collision resolver, contact handler, and the shock propagation, the predicted position 
is computed before doing a collision detection query. The collision detection query will therefore return 
future expected colliding contact points. These contact points arc used to change the current velocities 
such that penetration is avoided. However, having altered the velocities, one must update the predicted 
positions in the following iteration. The predicted positions arc computed taking a single explicit Euler 
step on (6.150a). 

Figure 6.33 shows pseudocode of the collision resolving. Contact handling is similar to collision 
resolving. Shock propagation is similar to the contact handling, however there arc some significant dif¬ 
ferences. A contact graph is built and contact points arc processed in an order corresponding to their 
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algorithm collision-resolve 

move all bodies to predicted locations 
collision detection 

order contact points by penetration depth 
for each contact point apply collision law 
end algorithm 


Figure 6.33: Collision resolving consists of a single iteration over all contacts. 


algorithm shock-propagation-handling 

move all bodies to predicted locations 
collision detection 
compute contact graph 

for each stack layer bottom up order 

order contact points in layer by penetration depth 
for each contact point apply collision law 
next layer 
end algorithm 


Figure 6.34: Shock propagation. This is similar to collision resolving except that contacts arc processed 
in a bottom-to-top stack layer fashion. 


placement in the contact graph. The contact graph is used to analyze if objects arc stacked on top of each 
other. Afterward, contact points arc organized into disjoint sets representing the stack layers in a stack. 
These layers arc processed in a bottom-to-top fashion using a coefficient of restitution equal to zero, and 
setting lower bodies to be fixed. That is, they have infinite mass for this computation. Figure 6.34 shows 
a pseudocode version of shock propagation. 

Taking large fixed time-steps is an advantage from a computational viewpoint, and thus very attractive 
for large-scale simulations. However, if time-steps arc too large, tunneling or overshooting can easily 
occur. To reduce the chance of this simulation artifact, the time-step can be limited; for instance, by 
making sure that the fastest moving object does not move further than some specified fraction of the 
smallest bounding box width of the smallest object. This can definitely be incorrect, but seems to work 
well in practice. 

In [Guendelman et al., 2003] after having applied an impulse to a contact point, the collision results arc 
reevaluated between the object pairs for that contact point. That is, the object pairs’ predicted positions arc 
updated and the narrow-phase collision detection query is repeated. This is done in the hope that resolving 
a colliding contact point may change the predicted positions, such that the object pair will no longer be 
colliding. The method outlined here corresponds to the optimized method in [Guendelman et al., 2003]. 

There arc a few publicly available implementations of the method using an even more optimized 
approach, in the sense that fewer collision detection queries arc performed. For instance, one can reduce 
the number of iterations to just one for both the collision resolving and contact handling. Furthermore, a 
single collision detection query could be run and the same contact points can be reused for the collision 
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(a) Shock propagation results in Figure 6.35(b) (b) Shock propagation results in Figure 6.35(a) 

Figure 6.35: Errors cannot be corrected by shock propagation if there arc cyclic dependencies. 

resolving, the contact handling, and the shock propagation. This, of course, will increase the chance of 
artifacts such as penetrations, tunneling, and overshooting. On the other hand, it will yield a speed-up of a 
factor of 10-16 depending on the number of iterations and the kind of collision detection algorithms one 
chooses to use. 

Finally, the purpose of shock propagation is to fix simulation errors. Therefore it can be replaced 
by other error-correcting methods , such as the projection error-correction explained in Section 7.10 and 
Section 7.14. These have the advantage of being able to completely fix penetration errors. Note that the 
propagation that occurs from the bottommost layer of a stack to the top, has trouble fixing errors for cyclic 
configurations as illustrated in Figure 6.35. 

6.4.1 Computing Stack Layers 

Contact graphs arc easily computed [Erleben, 2005], a contact group is a subset of objects in the configu¬ 
ration, all in mutual contact with each other. Edges arc created between objects if they arc in contact and 
contact points arc stored directly in these edges. Thus, we want to analyze a contact group for its stack 
structure and if possible, compute stack layers of the contact group. 

A stack is defined as a set of objects being supported by one or more fixed objects. A cup on top of 
a table is a stack. The table is the fixed body and the cup is supported by the table. Objects in a stack 
can be assigned a number indicating how far away they arc from the fixed object supporting them. This 
number is an indication of the height of the object in the stack. Thus, all fixed objects in a configuration 
have a stack height of zero. Nonhxed objects in direct contact with the fixed objects have a stack height 
of one. Nonhxed objects in direct contact with objects with stack height one, but not in contact with any 
fixed objects, have a stack height of two. A simple example is shown in Figure 6.36. This definition of 
stack height does not give a unique sense of an up-and-down direction as is commonly known from the 
real world. This is illustrated in Figure 6.37. Notice the position of the object with the largest stack height. 
An object is said to be closer to the bottom of the stack compared to another object if the stack height of 
the object is lower than the other object. Similarly, the bottommost objects arc those having the lowest 





Figure 6.36: Simple stacked objects annotated with stack height. 



Figure 6.37: Nonsimple stacked objects annotated with stack height. Free-floating objects arc marked 
with a question mark. 

stack height, that is the fixed objects. 

A free-floating object is special, since it is not in contact with any other objects; however, one may 
even have an entire group of bodies, all in mutual contact with each other, but none in contact with a fixed 
object. In these cases, it does not make sense to talk about assigning a stack height to the objects. Instead, 
the convention can be used to assign these kinds of objects an infinite stack height to distinguish them 
from objects that arc paid of a stack. A negative value could also be used, but is not an efficient choice for 
the algorithm presented in this section. 

The stack height of objects is easily computed by doing a breadth-first traversal on each contact group. 
Initially, the stack height of all objects is set to infinity unless they arc fixed objects, in which case their 
stack height is set to zero. Also, all fixed objects arc pushed onto a queue. This queue is used by the 
breadth-first traversal. The initialization steps arc shown in Figure 6.38. 

After the initialization, the breadth-first traversal will pop an object. A, from the queue and iterate 
over all incident contact graph edges to object A. For each edge it is tested to determine if the object, B, 
at the other end of the edge has been visited by the traversal before. If not, this object is pushed onto the 
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Queue Q 

for each body in Group do 
if body is fixed then 
height(body)=0 
Q.push(body) 
visit(body) = true 
else 

height(body) = infinity 
visit(body) = false 
next body 


Figure 6.38: Initialization of stack analysis algorithm. All bodies in a contact group arc traversed; fixed 
bodies arc identified and added to a queue for further processing. 


queue. The height hs of the object B is also computed as 

hs min(/i/ 3 , h,A + 1)- (6.151) 

That is, either a shorter path to object B is already known, in which case h b is left unchanged, or it is 
shorter to get to B, by going from A. The cost of taking this paid is one more than the cost of getting to 
object A. 

During the traversal a stack layer index is computed for the edges as they arc being visited. A stack 
layer is defined by two succeeding stack heights, such as 0 and 1. These two stack heights define a subset 
of objects in the contact group. That is, stack layer 0 is defined as all objects with stack height 0 and stack 
height 1, and all edges between these objects. Stack layer 1 is defined by all objects with stack height 1 
and stack height 2, and all edges between these objects, and so on. This means that an edge between an 
object with height i and another object with height i + 1 is given stack layer index i. 

Note that there is some subtlety with edges between objects with the same object height. As an 
example, for stack layer i, if we have an edge between two objects both with stack height i + 1, then the 
edge belongs to stack layer i. Ideally an edge between two objects with height i should also be added to 
stack layer i. However, this is not done because the stack layers arc processed in a bottom-up fashion, 
thus contact points belonging to the edge between objects at height i were taken care of when stack layer 
i — 1 was processed. 

Figure 6.39 shows pseudocode for assigning stack heights to objects and stack layer indices to edges. 

After having assigned stack heights to objects and stack layer indices to edges, it is a simple matter to 
traverse these and assign them to the respective layers they belong to. 

Objects arc a little special. Given an object, A at stack height i, one must traverse the edges and 
examine the stack heights of the objects at the other end. If an object B with stack height z — 1 is found, 
then object A is safely added to stack layer i — 1; if an object C is found with stack height i + 1, then 
object A is added to stack layer i. Object A can only belong to stack layer i — 1 and i; this means as soon 
as two other objects have been found indicating that object A should be in these two stack layers, one can 
stop traversing the remaining incident edges of object A. 

Figure 6.40 shows the pseudocode for building the stack layers. 
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List edges 

timestamp = timestamp + 1 
height = 0; 
while Q not empty 
A = pop(Q) 

for each edge on A do 

B = body on edge that is not A 
if not visit(B) then 
Q.push(B) 
visit(B) = true 
end if 

height(B) = min( height(B), height(A) + 1 ) 
if height(B) = height(A) and height(B) not 0 then 
layer(edge) = height(B) -1 
else 

layer(edge) = min( height(B), height(A) ) 

height = max(height,layer(edge)) 
if not timestamp(edge) = timestamp then 
timestamp(edge) = timestamp 
edges.push(edge) 
end if 
next edge 
end while 


Figure 6.39: A breadth-first traversal is performed, assigning a stack height to each body, equal to the 
number of edges on the minimum path to any fixed body. Edges of the contact group arc collected into a 
list for further processing. 



“book” — 2005/9/30 — 15:44 — page 182 — #194 


© 


a 


182 


CHAPTER 6. IMPULSE-BASED MULTIBODY ANIMATION 


Group layers[height +1] 
for each edge in edges do 
idx = layer(edge) 
add contacts(edge) to layers(idx) 
next edge 

for each body A in Group do 
if height(A)=infinity then 
continue 
end if 

in_lower = false 
in_upper = false 
for each edge on A do 
B = other body on edge 
if height(B) > height(A) then 
in_upper = true 
end if 

if height(B) < height(A) then 
in_lower = true 
end if 

if in_upper and in_lower then 
break 
next edge 
if in_upper then 

layers[height(A)].push(body) 
end if 

if in_lower then 

layers[height(A) - 1].push(body) 
end if 
next body 
return layers 


Figure 6.40: Building stack layers by processing all edges and bodies examining their stack height and 
layer indices. 
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Constraint-based simulation is usually grouped in four formulations: force-based, velocity-based, kinetic 
energy-based, and motion-space based. This chapter will discuss the force- and velocity-based meth¬ 
ods. We refer the reader to [Milenkovic et al., 2001, Schmidl, 2002] for the kinetic energy-method and to 
[Redon et al., 2003] for the motion-space method. 

In force-based formulations, the exact contact force at a given time is found and then used in an 
ordinary differential equation describing the motion of the bodies in the scene. In a sense, a force-based 
formulation sees the instantaneous picture of the configuration, while a velocity-based formulation on the 
other hand, sees the effect of the dynamics over an entire time interval. Imagine that the true physical 
contact force, f Uuc (t), is somehow known. The impulse J in the time interval At is then given as 

r-At. 

J = f true (f)dt. (7.1) 

JO 

and with Newton’s second law of motion one can solve for the velocity, v At, as follows 

f At dv f At 

J m ~dt dt = J -ftraeW ( 7 - 2a ) 

m (vAt -v 0 ) = J (7.2b) 

A new position can now be found by integrating the velocity. The “force,” f, which we try to solve for in 
a velocity-based formulation can be interpreted as 

J = A tf, (7.3) 

which numerically will produce the same movement as if we had known the true contact force and com¬ 
puted the time integral. Since velocity-based formulations solve for impulses, they are also called impulse- 
based formulations, not to be mistaken with impulse-based simulation, which is an entirely different sim¬ 
ulation paradigm, as discussed in Chapter 6. 

A force-based formulation on the other hand, tries to compute the force, / true (Z), then it will use the 
force to solve for the acceleration of the motion, which is then integrated once to yield velocities and twice 
to yield a new position. 

Force-based formulations cannot handle collisions, and we must switch to an impulse-momentum law 
at the point of collision [Baraff, 1989, Anitescu et al., 1996, Pfeiffer et al., 1996b, Chatterjee et al., 1998]. 
Force-based formulations also suffer from indeterminacy and inconsistency [Baraff, 1991, Stewart, 2000]. 
The velocity-based formulation suffers from none of these drawbacks. Another advantage of the impulse- 
based formulation is that it does not suffer from the small time-step problem in the same extent as the 
force-based formulation, meaning that larger time-steps can be taken during the simulation. The small 
time-step problem is described by Milenkovic and Schmild [Milenkovic et al., 2001, Schmidl, 2002]. 
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Velocity-based formulations in constraint-based methods arc widely popular - and used, e.g.. Open Dy¬ 
namics Engine [ODE, 2005], Karma from MathEngine [Karma, 2005], and Vortex from Critical Mass 
Labs [Vortex, 2005], In the following, we will present the classical velocity-based constraint formu¬ 
lation [Stewart et al., 1996, Stewart et al., 2000], give a possible object-oriented implementation design, 
and discuss various practical issues. Furthermore, we will discuss work presented [Anitescu et al., 1997, 
Sauer et al„ 1998, ODE, 2005], 

Many papers and books written on velocity-based formulations use a rather high and abstract level of 
mathematical notation together with a great amount of “long forgotten” analytical mechanics. There is a 
widespread notation and many small variations. 


7.1 Equations of Motion 

From classical mechanics, we have the Newton-Euler equations (see Section 22.1 for details), describing 
the motion for all bodies. For the i’th body, the mass of body i is given by m, and the inertia tensor by I, , 
the position is given by r % and the velocity of the center of mass as v t , the orientation is represented by 
the quaternion q t and the angular velocity by u> % . The Newton-Euler equations for the Tth body look like 
this (summations are taken over all contact points): 


ri = Vi (7.4a) 

Qi = \u%q% (7.4b) 

*i = mi 1 fk ~ mi 1 fk + m 7 ] ft eXt <7- 4c ) 

jk = i 

= i~ l r kj x fk - ir 1 rki x fk (7-4d) 

jk = i 'ik = 'i' 

- Ii l LVi x IiUJi + I^Ti ext (7.4e) 


The dot-notation means the total time derivative ^ and is used to ease readability. Observe that f k denotes 
the contact force at the fc’th contact. For the time being, we will ignore joints and motors. The effect of all 
external forces on the center of mass is given by f t ext and the total torque from external forces is given 
by r l ext . 

For notational convenience, we introduce a contact table. Consider a total of K contacts, and assign 
a unique number k to each contact. For each contact, we know the indices i k and j k of the two incident 
bodies. We use the convention that i k < j k . We also have a contact normal n k and a contact point p k both 
specified in the world coordinate system, and with the convention that the contact normal points from the 
body with the smallest index to the body with the largest index. This is illustrated in Figure 7.1. Note that 
we can never have i k = j k . For each contact we can compute a vector from the center of mass, r of an 
incident body with index i, to the point of contact p k , that is, 


rki =Pk~ r i 


(7.5) 
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Figure 7.1: The contact normal convention and notation. 


The Newton-Euler equations can now be written as 

s = Su (7.6a) 

u = M~ l ( CNf + f ext ) . (7.6b) 

Now we will introduce some matrix notation, which will allow us to write the Newton-Euler equations for 
all bodies in a single equation. The position and orientation of n bodies may be concatenated into a single 
generalized position and orientation vector, s 6 M 7n : 

s = [r 1 ,q 1 ,r 2 ,q2,--- ,r n ,q n ] T ■ (7.7) 

Similarly, we can write the generalized velocity vector u 6 M 6n as 

u=[vi,u)i,v 2 ,u>2,--- ,v n ,u} n ] T . (7.8) 

For the time being, the frictional effects will be ignored, implying that the contact force can be written as 

fk = fk.n k . (7.9) 

This means that we only need to remember the magnitude, f k , of the normal force, and these can now be 
concatenated into a single vector f e R k 



(7.10) 
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The external forces, torques, and velocity-dependent forces can also be concatenated into a vector, / ext G 

M 6n , 

l T 


fext = [/l eX S T 1 6X1 "WlX 1 \U>\, ■■■ , / n 6Xt , T n eXt - U n X I n U r 
Given q, = [.sy, ay, y ( , z,] T G M 4 , we can write the rotation as a matrix Q i G M 4x3 as: 


Q t = 


(7.11) 


-Xi 

~Vi 

-Zi 

Si 

Zi 

~Vi 

-Zi 

Si 

Xi 

Vi 

-Xi 

Si 


(7.12) 


where = Q ( aa,, as shown in Proposition 18.47. The rotations can now be concatenated into a 


matrix S G 


x6n 


5 = 


Qi 


o 


Qr 


(7.13) 


Matrix S' is also illustrated in Figure 7.2. The generalized mass matrix M G 


) 6n x 6n 


IS 


M = 


mil 


m„l 


(7.14) 


where 1 is the identity matrix. The layout of the mass matrix is illustrated in Figure 7.3. The matrix of 
contact normals N G R. 3KxK is 


N = 


n i 


n 2 


(7.15) 


0 

as shown in Figure 7.4, and the matrix of contact conditions C G K 6nx3i ^ 




is 


-1 

for l 

= 2i fc -l 

~r x 

kik 

for l 

= 2 4 

l 

for l 

= 2,4 - 1 

r x . 

kjk 

for l 

= 2,4 

0 

otherwise 


(7.16) 




Figure 7.2: The S matrix layout. 


Here r x £ M 3x3 is the skew-symmetric matrix given by 

0 -r 3 r 2 

r x = r 3 0 -n . (7.17) 

—r 2 r'i 0 

It is easy to show that r x a = r x a (see Chapter 18). Every column of C corresponds to a single contact 
and every row to a single body (see Figure 7.5). 

Using an Euler scheme as described in Chapter 23, we can write the discretized equations of motion 
as follows, 

S t+At = s t + A tSu t+At , (7.18a) 

U t+ At = u t + A tM~ l (CNf t+At + f ext ) . (7.18b) 

Here, superscript denotes the time at which a quantity is computed. Note that the matrices depend on time 
through u and s. If they arc evaluated at time t, then we have a semi-implicit method, and if they arc 
evaluated at time t + At, then we have an implicit method. 







“book” — 2005/9/30 — 15:44 — page 188 — #200 



188 


CHAPTER 7. CONSTRAINT-BASED MULTIBODY ANIMATION 








“book” — 2005/9/30 — 15:44 — page 189 — #201 


7.2 PHYSICS-BASED ANIMATION 


189 



Figure 7.5: The C matrix layout. 


7.2 The Contact Condition 


The projection matrix, P k G M ,Wx3 will be needed for further analysis of the /::’th contact point. Its 
transpose is defined as 



'0 

0 

o' 


0 

0 

0 


o 

0 

o 


0 0 0 
0 0 0 
0 0 0 


1 0 0 
0 1 0 
0 0 1 


0 0 0 
0 0 0 
0 0 0 


'0 

0 

o' 


0 

0 

0 


o 

0 

o 



(7.19) 


That is, the A:’th 3x3 submatrix is set to the identity matrix. The normal component of the relative contact 
velocity of the fc’th contact point is given by 

n k P kC T u = nl (v jk + u jk x r kjk ) - (v ik + u ik x r kik ). (7.20) 

Notice that multiplying by the projection matrix will mask out the k’th contact conditions. If body Bi k 
and Bj k touch at contact point p k at time t,, then the complementarity condition for the velocities must 
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hold 


n^P r [C T u t+At > 0 compl. to f k > 0. 


(7.21) 

A complementarity condition means that if we have two conditions, then one is nonzero and the other is 
zero or vice versa. Stewart and Trinkle [Stewart et al., 1996] originally used a complementarity condition 
on position. Anitscu and Potra [Anitescu et al., 1997] discovered that using the velocity complementarity 
problem guarantees solution existence. Sauer and Schomer [Sauer et al., 1998] expanded the velocity 
formulation further by handling future potential contact points. 

If there is no contact at the potential contact point p k at time /, then the following linearized comple¬ 
mentarity condition holds: 

Vk 


n T k P T k C T u t+At >^- t compl. to f k > 0. 


(7.22) 


Later we will go into details on the linearizing. If we use u k = 0 for all touching contacts, then we can 
formulate a complementarity condition for all contacts (touching and potential ones) as 

v 


N T C T u t+At > compl. to f > 0, 
with v = [vi ,..., vk] T £ - Inserting (7.18b) into (7.23) gives 

N t C t (u f + A/M- 1 (CNf t+At + f ext )) - ^ > 0. 

Rearranging yields 

N t C t M^CN A tf +At + N t C t {u l + A tM~ l f ext ) - > 0, 


At 


(7.23) 


(7.24) 


(7.25) 


which results in a linear complementarity problem ( LCP ) (see Section 19.10) of the form 

Ax + b > 0 compl. to x > 0, 
where A € Vi Kx K and x. b G M A . Entry /, k of A is 

T 


A-lk — biii k n i 


^hjk n l 


-s. 


jbk n l 


T 


+ 5 




1 

1 

- r x /r 1 r x . 

Hi i k tci k 

m ik 

_L 

1 

1 

- r x /“ 1 r x . 
til 3k tcj k 

m h 

-L 

1 

1 

— r x J- 1 r x 

m ik 

-L 

Ijl ik ki k 

1 

1 

- r x /- 1 r x - 


n k 


n k 


n k 


tt-kt 


with the Kronecker symbol being 


1 for i = j, 


(7.26) 


(7.27) 
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7.3 Linearizing 

Sauer and Schomer [Sauer et al., 1998] use a lincari/cd contact condition in (7.23). In the following, we 
will derive the linearized contact condition. The linearization serves as a measure of when a potential 
contact constraint should be switched on. This allows bigger time-steps to be taken while keeping the 
error low. Taking the same time-step size without the linearization will imply a larger approximation 
error. 

The /c'th potential contact point may be represented by the closest points p ik and p n between two 
bodies, Bi k and Bj k , which eventually meet and form the fc’th contact point. The closest points depend on 
the position and orientation of the bodies. If we let the vector s k 6 M 14 be the generalized position vector 
of the two bodies, where s k s function dependency of time has been omitted for readability, 

Sk = [r ik ,qi k ,r jk ,q jk \, (7.29) 

the minimal distance between the two bodies, d k (s k ), is 

dk(s k ) = nl(s k ) (; p jk -p ik )> 0, (7.30) 

where n k is a unit vector pointing from p ik to p - fc . A first-order Taylor-expansion of d k (s k ) at s k ' is 

40fc) = 4(0 + (y Sk d k (s k ')) T (s k - 8 k ) + 0(At 2 ). (7.31) 

Notice that V. Sfc is the functional derivative introduced in Section 18.4. If we look at the backward differ¬ 
ence of the time derivative of the generalized position vector, we find 

r l , s c t+At- _ t 

Jt( S C A ‘)= k Ai k +0(A » (7 ' 32) 

Rearranging yields 

s k t+At = s k + j t ( s k t+At ) A t + 0(At 2 ) (7.33) 

Again, we approximate V Sk d k (s k ') at $ k t+Al using Taylor’s Theorem by taking the zeroth order expan¬ 
sion to get 

V Sk d k (s k ')=X7 Sk d k (s k t+At ) + 0(At). (7.34) 

Substituting (7.33) for s k in (7.31) gives 

dk(s k t+At ) w 4(V) + (' Va k d k (8 k ')) T At - s k '^J (7.35) 

= d k (s k ') + (V Sfc 4(0) — s k 0 

+ At (V Sk d k (s k ')) T j f (s k t+At ) . (7.36) 
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Now we insert (7.34) in the last term and get 


dk( s k t+At ) ~ <4( s fc0 + (V Sk dk{s k ')) — s k ') 


(7.37) 


+ A((v. 1 4K' +A ')) T |( St ‘ +A '). 

Recall that the distance function is actually a function of time, dk(s k (t)), so by the chain rule we have 


(*(4 +A ')) = (v 4(4 +a *)) t 4 (4+ a <). 


dt 


— d k (s 


Inserting (7.38) into (7.37) yields 


d k(s k t+At ) « d k (s k ') + (V ak d k (s k ')) ( s k f - s k ) 


+ At 


dt 


t+At\\ 


> 


From past work such as [Baraff, 2001, Bara IT, 1994], we know 

j t (4(0) = nl ((' v jk + u jk x r jk ) - (v ik + u ik x r ik )). 

Using this in (7.39) together with (7.30), we derive 

dk( s k) + (y s k dk(s k ')) (s k f — s k ') 

+ A tnl 


Rearranging, we have 


N +i ‘+"7 +a ‘ >< N +A ‘ 


T 


v 1 - t+At + u;, t+At xr, t+At 

3k 3k 3k 


) - (t),‘ +At + «, t+At xr,‘ +At )) 








77 


- (4(0 + (V Sk dk(s k ')) (V — «*/))• 


Recall that the left side of this equation is in fact n k P k C T u *+ At . It now follows that 


n k^k C T u t+At > (dk(s k ') + (V Sk 4(0)" (O - O) • 
Comparing with (7.22), we write 


1 

At 


(7.38) 


(7.39) 


(7.40) 


(7.41) 


(7.42) 


(7.43) 


v k — ~ (dk{s k ) + (y Sk d k (s k ')) (sfc* — s fcO) • (7.44) 

All curvature information is lost with the linearized constraints, which implies that a step of length O(At) 
introduces errors of 0(At 2 ). Flence the approach of Sauer and Schomer prevents the penetration of 
increasing to more than 0(h 2 ). 
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Figure 7.6: The friction pyramid approximation. We have chosen rj = 6; observe that the d/ lk ’s positively 
span the friction pyramid. 


7.4 The Frictional Case 


In this section we will expand the formulation given in (7.26) to include friction. For each contact, we use 
two orthogonal unit vectors t\ k and t- 2 k , which span the tangential plane at the k ’th contact. Together with 
the normal vector n k , the three vectors form an orthogonal coordinate system (see Definition 18.6). The 
friction cone at the k' th contact is approximated by a discretized version having r/ direction vectors d ,/ tfc 
with h = 1,..., r), where 7] = 2 i for all / £ N and i > 2. The direction vectors arc concatenated into a 
matrix D k eM 3xi) 

D k = [d lk ,..,,d Vk ], (7.45) 


where 


dh, k — 


= cos 


2 (h — 1)7T 
V 


ti, + sin 


2(h — 1)7T 
V 


^2i- 


(7.46) 


We have transformed the spatial cone limiting the friction force due to Coulomb’s friction law and called 
the friction cone into & friction pyramid with ?/ facets, as illustrated in Figure 7.6. For each direction vector 
we will use fh k for the magnitude of the component of friction force in the direction of dh k . As before, 
we can build up a vector of all friction components f3 k £ W 1 , 


Pk = [Pi k ,---,Pvk] T ■ 


(7.47) 


The modification of the equations of motion (7.6) is the definition of contact force f k from (7.9), which 
we now write as 


f k — fk n k + J-^kPk- 


(7.48) 
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D = 


eta K 



Figure 7.7: The D matrix layout. 


As before, we use matrix notation, which will allow us to write the equations of motion in a single matrix 
equation. The generalized acceleration is again described by the Newton-Euler equations 

u = M~ l (C (Nf + Dp) + f ext ). (7.49) 

We need a vector, /3 € BA 1 ' 

(3= [/3f,...X-,] T (7.50) 

That is the concatenation of the /3/.-vectors; we also need the matrix D e rzkxvk 


D = 


D i 


D, 


D 


I< 


(7.51) 


Figure 7.7 illustrates the D matrix layout. Using an Euler scheme gives us the discretized approximation 

u t+At = u t + A/M- 1 (C (Nf + Dp) + f ext ). (7.52) 


In order to model the relationship between the normal force and the friction, known as Coulomb’s friction 
law, we need to add two complementarity conditions for the friction forces in addition to the previous 
complementarity condition for the normal force. We have a total of three complementarity conditions for 
the A’’th contact: 


A k e k + D T k P T k C T u t+At >0 

compl. to 

/3fc A 0, 

(7.53a) 

Mfc/fc - elP k > 0 

compl. to 

A k > 0, 

(7.53b) 

n T k P T k C T u t+At — ^ > 0 
k k At ~ 

compl. to 

fk > o. 

(7.53c) 


where Hk is the friction coefficient at the k "th contact point and e& = [1,..., l] T e W r] . The symbol A/,- 
is a Lagrange multiplier with no real physical meaning, but it is an approximation to the magnitude of the 
relative tangential contact velocity. Possible contact states modeled by (7.53) arc: 
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Separation: In this case, n k Pj(C T u t+At — ^ > 0, and (7.53c) implies that f k = 0. Substitution of 
this into (7.53b), implies that (3 k = 0, i.e. there is no friction force. From (7.53a) we see that X k 
can take on any value without violating the conditions. 

Sliding: For sliding, D[ P{ : C r u t+At is nonzero, since the columns of D k positively span the entire 
contact plane. There must be at least one direction vector such that dj( k Pj. C T u t+At < 0, and 
since the corresponding / 3h k > 0, we must have X k > 0 for (7.53a) to hold, and (7.53b) implies that 

fthfr — dkfk- 

Rolling: In this case, D k P' k C T u t+At is zero, and (7.53a) implies that X k > 0. There arc two interest¬ 
ing cases: 


Case 1: Choosing X k = 0 (7.53a) implies that (3 k > 0. This means that the contact impulse can 
range over the interior and the surface of discretized friction cone. 

Case 2: Choosing A/,. > 0 (7.53a) implies that (3 k = 0, (7.53b) will only be fulfilled if p k f k = 0. 
This is a nongeneric case that occurs by chance in the absence of & frictional impulse, that is, 
when fj, k = 0. 


We can now proceed analogously to the frictionless case and tty to insert (7.52) into (7.53a) and (7.53c): 

X k e k + D T k P T k C T (u t + A/M- 1 (C ( Nf + D{3) + / ext )) > 0 

compl. to (3 k > 0, (7.54a) 

n T k P T k C T {u 1 + A/M- 1 (C (Nf + D(3) + / ext )) - ^ > 0 

compl. to f k > 0. (7.54b) 


Rearranging provides us with two new complementarity conditions, which replace those in (7.53a) and 
(7.53c): 


AtDlPlC T M~ 1 CNf + AtD k P'J(c T M~ 1 CD/3 
+X k e k + D T k P T k C T u 1 + A tD T k P T k C T M~ l f exi > 0 

compl. to (3 k > 0, (7.55a) 

A tnlPlC T M~ l CNf + AtnlPlC T M- l CDf3 

+n T k P T k C T u t + A tnlP T k C T M~ 1 f ext - ^ > 0 

compl. to f k > 0. (7.55b) 
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By rearranging the complementarity conditions (7.53b), (7.55a), and (7.55b), we can formulate the LCP- 
formulation (see Section 19.10) on matrix from as 


D t C t M~ 1 CD D 1 C 1 M l CN E At/3 
N t C t M~ 1 CD N t C t M~ 1 CN 0 • A tf 

— E^ fJ, 0 A aux 

D T C T {u t + AtM~ 1 f ext ) 

+ N T C T (u' + AtM-'f^--^ >0 

0 

'At/3' 

compl. to Atf > 0, 

A I \ 


■>T r^T li/r-l i 


(7.56) 


where the diagonal matrix /r, <E 


is given as 


and the matrix e e rvKxk is given by 


(7.57) 


(7.58) 


That is, E consist of IRA dimensional submatrices. All submatrices on the diagonal consist of ones and of 
diagonal submatrices that are 0 (see Figure 7.8). Finally, the vector A aux £ M A is given as 


A a ux — [A 1; • • • i A/f] 

Let the matrix A £ r( j i+ 2 ) Kx ( 1 ^+ 2 ) k be defined as 

D t C t M 1 CD D t C t M- 1 CN e 
a= N T C T M~ l CD N t C t M~ 1 CN 0 , 


(7.59) 


(7.60) 


and the vector x £ r0?+ 2 ) a as 


'At/3' 

x = Atf , 

Aaux 


(7.61) 
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Figure 7.8: The E matrix layout. 


and the vector b E R^+ 2 ) K as 


b = 


D T C T (u t + AtM~ 1 f ext ) 
N t C t (w t + AtM^fJ) - ^ 
0 


then we see that we have a typical LCP formulation (see Section 19.10) of the form 


Ax + b > 0 compl. to x > 0. 


(7.62) 


(7.63) 


The above formulation can further be extended to include torsional friction [Trinkle et al., 2001]. 

Because of real-time demands, a scalable friction model for time-critical computing is important; the 
constraint-based method is easily adopted to a scalable friction model by controlling the number of facets, 
tj, used in the friction pyramid approximation. 

Several methods could be used for setting the value of 77 ; a global control could be used based on the 
amount of computation time or the total number of variables in the LCP problem (see Section 19.10). If 
either of these exceed some given limits, 77 is decreased correspondingly. 

However, local control could also be used. Often only visualization is important; accurate friction is 
therefore only needed for objects seen by a user. In such cases, it is reasonable to use a low 7/ for all 
objects outside the view-frustum, and for those objects inside the view-frustum a higher 77 value is used. 


7.5 Joints 

In the previous sections we have treated the problem of contact mechanics using classical mechanics 
taught in first-year undergraduate physics and linear algebra. The approach is straightforward and easy 
to understand even though there are many symbols and much notation. Until now we have treated what 
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is called unilateral contacts , where unilateral refers to the “>”-constraints on the contact forces. In this 
section, we will tty to generalize our formulation and include bilateral constraints. Here bilateral refers 
to a “=”-constraint on the constraint forces. Bilateral constraints arc used for modeling joints between the 
bodies, such as hinges and ball-in-socket connections. 

In this section, we will show how we can go from the formulation based on classical mechanics into a 
formulation based on analytical mechanics. To achieve a more abstract and general formulation, we need 
to introduce concepts of holonomic and non-holonomic constraints and Jacobians. 

The analytical mechanics approach for complementarity problems has been studied amply in the liter¬ 
ature, [Pfeiffer et al., 1996b, Anitescu et ah, 1996, Anitescu et al., 1997, Anitescu et al., 2002]. A useful 
reference for further reading on analytical mechanics is [Goldstein et ah, 2002]. 

7.5.1 Holonomic Constraints 

Working with constraints, we arc particularly interested in the number of degrees of freedom (DOF), that 
is, the minimum set of parameters needed to describe the motion in our system. For instance, a free- 
moving body has six DOFs because we need at least three parameters to describe its position and at least 
three parameters to describe its orientation. For two free-floating rigid bodies we have 12 DOFs, from 
which we conclude that the smallest possible generalized position vector we can find will have 12 entries. 

Following the conventions from previous sections, the spatial position vector si =€ M 14 for the V th 
joint between the two bodies B l( and Bj t can be written as 

si = [hpftiTipfef ■ (7-64) 

To facilitate notation we will not bother to write the subscript indicating the joint number or the contact 
number in the following sections, so we simply write 

s = [r i ,qi,r j ,q j ] T . (7.65) 

The position vector s is not the minimum set of parameters, since we use quaternions for the represen¬ 
tation of the orientations, and thus use four parameters instead of the minimal three for each orientation. 
For describing velocities and accelerations we could use time derivatives of the quaternions, but this is 
tedious, since the physics laws use angular velocities u. Instead, we need a transformation like the one 
we introduced in Section 7.1 

s = Su, (7.66) 

where 

u = [v i ,u>i,Vj,uj j ] T , (7.67) 


and 
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We write the position vector r e M 12 associated with the integrals of u as 

r = [r i ,e i ,r j ,e j } T , (7.69a) 

= [xi,yi,Zi,a i ,f3i,'Yi,x j ,y j ,z j ,a j ,(3 j ,'y j ] T . (7.69b) 

Here 0, is the integral quantities of u r , that is, 


u = 



(7.70) 


In general, the quantities 0, and 9j in r do not give meaning as finite quantities, and in plain computer 
graphics language you cannot use them like Euler angles to make a rotation matrix. Nevertheless, r is a 
minimum spatial position vector. 

When we link two rigid bodies together by a joint, then we are removing DOFs from the system, 
and we can therefore find an even smaller generalized position vector. For instance, if we make a rigid 
connection between the two free-floating bodies, then we can remove six DOFs because we only need to 
describe the movement of one of the bodies, and the movement of the other body will follow immediately 
from the movement of the first body. This means that the smallest possible generalized position vector has 
six entries. From the example we see that we can at most remove six DOFs from any joint. 

By definition, a holonomic constraint between two bodies Bi and B } can be written as a function <f> 
of time and spatial position vector s =G M 14 , such that we always have 


s) = 0. 


(7.71) 


All joint types presented by us can be modeled by time-independent holonomic constraints, meaning that 
for the Z’th joint we have m holonomic constraints 


$i(s) = 0, (7.72a) 

$ 2 (s) = 0, (7.72b) 


$m(s) = 0, (7.72c) 

where m is the number of degrees of freedom removed by the constraints. This type of holonomic con¬ 
straint is called a scleronomous constraint. 

Assume that the Z'th joint is a ball-in-socket joint between the two bodies Bi and Bj. A ball-in-socket 
joint is characterized by the fact that two points, one from each body, arc always connected to each other, 
meaning that we have one constraint saying the .c-coordinatcs of the two points must be equal, another 
constraint requiring equality of the y-coordinatcs, and a third one for equality of the ^-coordinates. That 
is, if we let the two points be specified by two fixed vectors r? lllt . and r{ n c in the respective body frames of 
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Figure 7.9: A ball in a socket joint in 2D. 


the two bodies as shown in Figure 7.9, then we can formulate the geometric constraint as follows 


(rq + R( qi )r l mc ) - (rj + R{q j )ri nc )] x = 0, 

(7.73a) 

$1 

[(n + R (<n) r mc) - ( r j + R (Qj) r l nc)]y = 0, 

(7.73b) 

$2 

[(»■* + R (Vi) r lnc) - { r j + R (Vj) r L c)] 2 = 0, 

(7.73c) 


$ 3 


where R(q) is the corresponding rotation matrix of the quaternion q. From the equations above it is clear 
that the geometric constraint characterizing the ball-in-socket joint can be expressed as three holonomic 
constraints on vector form as 


*(s) 


$l(s) 
$2 (S) 

$ 3 ( 8 ) 


= 0 . 


(7.74) 


Note that the small example is not only illustrative; it actually provides us with a recipe for deriving 
different joint types. In conclusion, a holonomic constraint is equivalent to removing a degree of freedom 
from the system, which means that we can find a generalized position vector with one less entry than the 
spatial position vector. 

By differentiation with respect to time, we can derive a kinematic constraint from each holonomic 
constraint 


d<& ds 

ds dt 

(7.75a) 

u 

os 

(7.75b) 

J $ 

0. 

(7.75c) 
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The matrices G R mx 14 and J G R mx 12 arc called Jacobians; they describe relations between 
velocities in different coordinate representations. Finally, we have the kinematic constraint 


J §ui = 0. 


(7.76) 


Performing another differentiation w.r.t. time leads us to the acceleration constraint 

= (7.77a) 

= ^(J$)u +(7.77b) 
= 0, (7.77c) 

from which we conclude that 

= — J^,u. (7.78) 

For our velocity-based formulation we have no use for the acceleration constraint; however, if we set up a 
force-based formulation, we would need to augment the Newton-Euler equations with these acceleration 
constraints. 

It is well known that the generalized constraint force exerted by a holonomic constraint can be written 
as 

= J% A*. (7.79) 

This follows from the principle of virtual work described in Section 22.6. The A,j> G R m is a vector of 
Lagrange multipliers. They account for the reaction forces coming from the joint bearings; the Lagrange 
multipliers can take any real value, both positive and negative. Observe that the dimension of A$ depends 
on the number of constraints on the joint. Therefore, we conclude that we have as many independent 
reaction forces as there arc constraints. 

7.5.2 Non-Holonomic Constraints 

A non-holonomic constraint is a differential constraint that cannot be integrated; however in this context, 
we define a non-holonomic constraint as a constraint that cannot be put into the form of a holonomic con¬ 
straint (7.71). There arc many different kinds of non-holonomic constraints, and we will restrict ourselves 
to a certain kind, namely those called unilateral constraints. 

The non-holonomic constraint between two bodies Bi and Bj can, by definition, be written as a 
function T of time and generalized position vector s G R 14 , such that we always have 

&(t,s)> 0. (7.80) 

The condition for a contact can be modeled by (1 + 77 ) time-independent non-holonomic constraints, that 

is, 

Tf(s) > 0. (7.81) 
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This looks very different from the contact conditions we have seen previously, but don’t be concerned; the 
connection with our previous derivations will be made clear later on. Taking the time derivative gives us 
a kinematic contact constraint. 


d T . . tAP ds 
dt 1 ’ ds dt 

(7.82a) 

d* 

^ds^ 

(7.82b) 

J \j/ 


= JqiU 

(7.82c) 

>0, 

(7.82d) 


where J# G R( 1+r 7) x12 is the Jacobian of the contact constraint. Taking the time derivative one more time 
yields an acceleration constraint 


d? 

d , 



= s ( j„ u ) 

(7.83a) 


— J tyVi + J xj/U 

(7.83b) 


> 0 . 

(7.83c) 


The generalized constraint force exerted by the contact constraint can be written as 


— J^j Avj/, (7.84) 

where A v j, G M 1+?? is the vector of Lagrange multipliers. Unlike the Lagrange multipliers used for the 
joint reaction forces, the Lagrange multipliers for the contacts can only take nonnegative values, that is, 

A* > 0. (7.85) 

As pointed out by Anitscu and Potra [Anitescu et al., 2002], one should be careful about the constraint 
4®(a) > 0, because if we have 

¥(s) > 0 (7.86) 

indicating a potential future contact, this does not imply that 

^*(a) > 0. (7.87) 

Only so-called active contacts, where 

’P(s) = 0, (7.88) 

require this. This boils down to the fact that separated contacts arc allowed to move toward each other 
until they become touching contacts, and a touching contact can either continue with being a touching 
contact, or it can become a separated contact. 
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Momentarily reintroducing the subscript for the contact ordering, we can write all the kinematic con¬ 
straints and force contributions as 


J. y k Uk > 0, where Jy k G ]R( 1 + ? t) xl2 ) 
fy k = where A^ fc G R 1+v . 


Concatenating them into matrix equations, we have 

J contact^ = 0, 


/ c 


= J 


contact '''contact; 


(7.89a) 

(7.89b) 


(7.90a) 

(7.90b) 


where u G Re 6n is the generalized velocity vector (Section 7.1), u = [vi,uj i,V2, u>2i • • • , v n , u n ] T , and 
Acontact € 14 ' / ' 1 is the concatenated vector of all the Lagrange multipliers. 


A^, ,..., A 


(’ 7 + 1 ) 

v&i ’ ' ' ' > ' X '&K 


A«f K , • • •, A 


(’7+1) 


l T 


(7.91) 


The J contact £ 11 + r d x j s p lc S y S tem Jacobian for all the contacts, and it is given by 


' contact 


T1 

J >I'i 


ri 


T1 


7* 


17 +fc 


T n 




j-n 


(7.92) 


This Jacobian is extremely sparse since the k ’th contact only involves two bodies i and j, meaning that 
the only nonzero entries in the A:'th row of J con tact are the columns corresponding to the bodies i and j 


J 


I 3 

k J Vk 




(7.93) 


We will now prove that 


C (N f + Df3) = J contact A con tact- (7.94) 

The above equation follows from straightforward computations and permutations of the left-hand side of 


C (Nf + D(3) = CNf + CD/3 
= [CN CD] 


(7.95a) 

(7.95b) 


( JT \ 

V u contact J _ / \ \ 

7T Acontact ) 


— 7r (^contact^contact) ; 


(7.95c) 
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where 7r(-) is a permutation. Now we simply swap rows and columns, such that 

= fk1 

= ^lfc) 


(7.96a) 

(7.96b) 


and the relation between the Jacobian, 


J 


A 


contact ? 


07 + 1 ) _ a 

<1 > k ~ i 

and the matrices C, N, and D is clear. 


(7.96c) 


7.5.3 A Unified Notation for Unilateral and Bilateral Constraints 


Now we’ll show that both bilateral and unilateral constraints arc added to the govern system of equations 
of motion through the same notion of Jacobian and Lagrange multipliers. 

Momentarily reintroducing the subscripting on both the joint and contact ordering, we have K contacts 
and L joints, and we can write all their kinematic constraints and force contributions as 


J<f> t ui = 0, where (7.97a) 

J^ k Uk>0, where Jq, k G m( 1+, ?) x12 ; (7.97b) 

= A#,, where A (7.97c) 

fq, k = J% k Ay k , where A^eM 1 ^. (7.97d) 

Following the same recipe as in Section 7.5.2 for concatenating these into matrix notations, we get 

J joint = 0, where J joi „ t G (7.98a) 

Jcontact«>0, Where Jcontact G R^ (1+?7)><6? \ (7.98b) 

/joint = J jdnt A joint, where Ajoint G (7.98c) 

/contact = 'Contact -Contact) where A con t ac t GM ^ + (7.98d) 


The Jacobian J con tact and the Lagrange multiplier vector A conta ct was given in (7.92) and (7.91). The 
system joint Jacobian Jj 0 i nt and the joint Lagrange multiplier vector Aj 0 i nt follows the same pattern and is 
given as 



J 


n 

$1 


J joint — 




l J l L 


U 


jn 

J $L J 


(7.99) 


This Jacobian is inherently extremely sparse, since the Tth joint only involves two bodies i and j, meaning 
that the only nonzero entries in the Tth row of Jj 0 i nt arc the columns corresponding to the bodies i and j 
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and 

©oint — 

Using the matrix notation to write constraint forces of both bilateral and unilateral constraint, the general¬ 
ized acceleration vector can be written as 


\1 \®l 

A$, , . . • , A 


$1 i 


\ 1 

, A<j 


<&K' 


\ m K 


(7.101) 


V' ^ (f contact 

= M~ 1 f T t 

iv± yo contact 


f joint 3” f ext) 


+ /ext) • 


(7.102a) 

(7.102b) 


This is a completely general way to add constraints; it will be further explored in the remainder of this 
chapter. In the end it will also lead to a general and efficient implementation framework. 


7.6 Joint Modeling 


In this section we will derive the machinery for modeling joints and later joint limits as well as joint 
motors. We will staid by introducing a submatrix pattern of the Jacobian matrix. Hereafter, we will 
describe joint error, connectivity, and error reduction. 

For the I'th joint constraint, we can write the kinematic constraint as 

Jiui = 0. (7.103) 


Since we will focus on joint types, we will omit writing the subscript indicating the joint ordering, that is, 
for a given joint type we simply write the kinematic constraint as Ju = 0. There is a remarkable submatrix 
pattern of the Jacobians, which we will use extensively, because later on it will make the assembly of the 
system matrix easier, that is, A. Writing the generalized velocity vector with its subvectors as 


Ju = 0, 





Vi 

u>i 


= 0. 


(7.104a) 

(7.104b) 


Observe that there is a part of the Jacobian matrix that is only multiplied with the linear velocity of body 
i, which is denoted J j in , a paid that is only multiplied by the angular paid of body i, J j ng , and so on. In 
fact we can interpret 

©in v i + J^gUJi, (7.105) 

as the velocity of the joint bearings on body i, and 

©in v j + ©mg u j 1 (7.106) 


as the velocity of the joint bearing on body j. It is now obvious that in order to keep the joint bearings 
together, the bearings must move with the same velocity and the sum must therefore be zero. This obser¬ 
vation provides us with a strategy for designing the Jacobians: given the body velocities, set up a matrix 
equation, such that the relative velocity in the direction of the joint bearings is always zero. 
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7.6.1 Joint Error 

By now it should be clear that the kinematic constraints arc constraints on the velocities, not the positions; 
meaning that both numerical errors and errors stemming from internal approximations can sneak into 
the computations of the positions as the simulation proceeds. Imagine that some positional error has 
occurred, such that there is a positional displacement of the joint bearings and/or a misalignment of the 
joint bearings. This error could be reduced by adjusting the velocities of the joint bearings, such that the 
error is smaller in the following simulation step. Therefore we augment our kinematic constraints with a 
velocity error collection term, b 

Ju = b. (7.107) 

To illustrate, we will present a simple one-dimensional example: imagine two particles that can move 
along a line, where the particles arc jointed together, such that their positions arc always equal. Their 
kinematic constraint will then be 

Vi — Vj = 0. (7.108) 

Now imagine that some error is present 


r ea = rj-r i , (7.109) 

with 11T err 11 > 0. To adjust the velocities so that this error is eliminated within some time At, we require 
that 

(7.110a) 
(7.110b) 

If joint or limits is subject to an initial error and incident links arc at rest, then error terms will acceler¬ 
ate the links; so not only will the error be corrected, but also bodies will continue to move afterward. This 
is obviously an unwanted effect! 

The error correction should not add kinetic energy to the system. In fact, the error correction has 
the same effect as using Newton’s collision law for solving simultaneous collisions [Baraff, 1989]. An 
acceptable and practical workaround is to use an error reduction parameter to control the rate of error 
correction, which will be discussed in Section 7.6.3. 

7.6.2 Connectivity 

We will describe the connectivity and movements of all joints by using anchor points and joint axes. An 
anchor point is a point in space, where two points, one from each incident body arc always perfectly 
aligned. The placement of an anchor point relative to a body, i, is given by a body frame vector, r an ' c . The 
position of the anchor point in the world coordinate system (WCS) w.r.t. to body i is given by 



Ju = b. 
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A joint axis describes an allowed direction of movement, such as an axis of rotation or a direction of 
sliding. The joint axis is given by a 3D unit vector, s ax " cs . In Section 7.7 we will explain the details in 
describing different joint types using the notation of anchor points and joint axes. 

This way of describing the connectivity is very similar to the paired joint coordinate frames described 
in Featherstone [Featherstone, 1998]. In comparison, anchor points correspond to the placement of the 
origin of the joint frames and joint axes correspond to the orientation of the joint frames, such as the z-axis 
of the joint coordinate frame. Alternative notations for describing the connectivity of jointed mechanisms 
arc used in some literature [Featherstone, 1998, Craig, 1986]. 

7.6.3 Error Reduction Parameter 

The kind of approach for simulating joints that we arc outlining in this paper belongs to a class of algo¬ 
rithms referred to as Full-Coordinate methods because every body in a jointed mechanism is described by 
the full set of rigid body motion coordinates. 

An alternative approach is the Reduced Coordinate methods, where a good example is Featherstone’s 
algorithm [Featherstone, 1998]. The central idea is that only the relative motion of bodies between joints 
needs to be described; only the relative coordinates of the joints arc therefore needed. 

The main difference between the two approaches is that Reduced Coordinate methods explicitly work 
with joint parameters. The position and placement of the links arc derived from these joint parameters. 
With a Full-Coordinate method, we work explicitly on the links and we need to derive joint parameters if 
needed. There arc some benefits and disadvantages of these methods, which we will describe shortly. 

The Reduced Coordinate methods arc often computationally faster, since they have fewer variables 
to work on, and since they arc often implemented by recursive algorithms like Armstrong and Feather¬ 
stone [Armstrong et al., 1985, Featherstone, 1998]. These recursive algorithms arc often limited to tree¬ 
like mechanisms and only with great difficulty can these recursive algorithms be extended to handle closed 
loops and contacts. 

The Full-Coordinate methods arc not limited by any kind of topology, but they arc often more com¬ 
putationally demanding, because they must describe all the constraints on each link’s rigid body motion. 
Reduced Coordinate methods need only to describe the free movement, which is often of lesser dimension. 

Many people prefer the Full-Coordinate methods because they think the notation is easier to read and 
work with. Reduced Coordinate methods appeal - to have long and difficult terms representing coriolis 
and centripetal accelerations. 

From a computer animation viewpoint, numerical errors in a Full-Coordinate method seem to be much 
more noticeable than in a Reduced Coordinate method. This is because errors in the body coordinates will 
split joints apart and introduce an effect called drifting because links that supposedly should be jointed 
together are drifting apart. Reduced Coordinate methods do not suffer from the drifting problem, since no 
matter how big numerical errors one obtains, the simulation will always show bodies connected properly. 

In conclusion, with Full-Coordinate methods we can expect drifting problems; there are two ways 
these can arise in a working simulator: 

• The user interacts with a mechanism and forgets to set the correct position or orientation of all the 
links in a mechanism. 
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• During the simulation, errors can creep in that result in the links drifting away from their joints. 

In Section 7.7, we describe the kinematic constraints of different joint types and we will introduce some 
error correcting terms. These arc all multiplied by a coefficient, k con , which denotes a measure of the rate 
of error correction. The idea is as follows: for each joint we will have an error reduction parameter, k erp 

0 < Awp < 1. (7.112) 

Its value is a measure for how much error reduction should occur in the next simulation step. A value of 
zero means that there is no error correction at all; a value of one means that the error should be totally 
eliminated. 

If we let the duration of time in the next simulation step be denoted by a characteristic time-step. At, 
then the following constant is a measure of rate of change 

fcf P s = ^;. (7.113) 

The coefficient k cor can now be determined as 

kcor = ^erp^fps- (7.114) 

Setting k erp = 1 is not recommended, since various internal approximations can cause the errors not to be 
completely fixed. The Open Dynamics Engine [ODE, 2005] uses the same approach for correcting errors. 
They recommend using a value around 0.8. 


7.7 Joint Types 

In this section we will derive the Jacobians for several different kinds of joint types needed for the kine¬ 
matic constraints explained in the previous sections. 


7.7.1 Ball-in-Socket Joint 

A ball-in-socket joint allows arbitrary rotation between two bodies as illustrated in Figure 7.10. We already 
know that a ball-in-socket joint removes three DOFs, so we conclude that the Jacobian, J bail, for the ball 
is a 3 x 12 matrix. From our previous example in Section 7.5.1 it should not come as a surprise that the 
submatrix of the Jacobian is given by 


J ball — 


Jr i J r J J 

u lin ’ u lm ’ u ang i u a: 


ang 


(7.115) 
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Figure 7.10: A ball-in-socket joint example. 


where 

"l 0 O' 

Jj in = 0 10, (7.116a) 

|_0 0 1 

[-10 O' 

J{ n = 0-10, (7.116b) 

0 0-1 

J Lg = » (7.116c) 

Jln g =(R(qj)rL)\ (7.116d) 

and where J[ in G R 3x3 , .7j n „ G M 3x3 , Jj in G M 3x3 , Jj na G M 3x3 , and the velocity error correcting term, 
&baii G R 3 , is given by 

bbaii = ^cor {rj + R{qj)r{ nc - rj - it(9i)r^ c ) . (7.117) 

7.7.2 Hinge Joint 

A /j/nge joint, also called a revolute joint, only allows relative rotation around a specified joint axis as 
illustrated in Figure 7.11. We describe the joint by an anchor point placed on the axis of rotation and a 
joint axis, given by a unit vector in the world coordinate system. We only have one DOF, meaning 

that the hinge joint places five constraints on the relative movement. Hence the Jacobian, J hi nge is a 5 x 12 
matrix 
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Figure 7.11: A hinge joint example. 


where Jj in € R 5x3 , J ang E R 5x3 , Jj in E R 5x3 , J{ ng E R 5x3 , and fringe G R 5 . A hinge joint has the same 
kind of positional constraints as the ball-in-socket joint, so we can immediately borrow the first three rows 
of the ball-in-socket Jacobian and the error measure, and we only have to extend the hinge Jacobian with 
two more rows, which will constrain the rotational freedom from the ball-in-socket joint to only one axis 
of rotation. 

The strategy for adding the two rotational constraints is as follows: since we only want to allow 
rotations around the joint axis, only the relative angular velocity of the two bodies with respect to the joint 
axis is allowed to be nonzero, that is, 


^axis ' (Wi Wj) 7^ 0. 


(7.119) 


The relative angular velocity with any other axis orthogonal to s ax is must be zero. 

In particular, if we let the two vectors t 1 wcs , t 2 wcs E R 3 be two orthogonal unit vectors, and require 
them to be orthogonal to the joint axis .s, lx * cs , then 


wcs • (u>i — ojj) = 0, (7.120a) 

£ 2 wcs • (wi -Uj) = 0. (7.120b) 


From these two equations we have the two needed kinematic constraints and we can write the hinge 
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Jacobian as follows: 


ji 

J li 


lin 


p 

J lin 


V 

^ ang 


7 J 

^ ang 


1 o o 
0 1 0 
0 0 1 
0 0 0 
0 0 0 


-1 

0 

0 

0 

0 


0 

-1 

0 

0 

0 


0 

0 

-1 

0 

0 


-(R(qi)ri nc )' 
($! WCS ) T 
(t 2 wcs ) T 

{ R (Qj) r L 

- (t x wcs y 

-(*2 WCS ) T 


(7.121a) 


(7.121b) 


(7.121c) 


(7.121d) 


For the error-measure term, we already have the first three error measures from the ball-in-socket joint 
taking care of positional errors. Two further error measures arc needed for rotational misalignment around 
any nonjoint axes. 

If we store the joint axis with respect to both body frames, ,s ax ' is and s ax ] s , then computing the joint 
axis directions in the world coordinate system with respect to each of the incident bodies gives 


(7.122a) 
(7.122b) 

If s- wcs = Sj wcs , then there is obviously no error in the relative hinge orientation between the bodies. If 
there is an error, then the bodies must be rotated such that s i wcs and s ■ wcs arc equal. This can be done as 
follows: imagine the angle between the two vectors is 9 en , then we can fix the relative error by rotation of 
f?err radians around the axis 

U = s t wcs x Sj wcs . (7.123) 

Let’s say that we want to correct the error by the angle 9 cor within the time At, which could be the 
size of the time-step in some time-stepping algorithm, then we would need a relative angular velocity of 


a* wcs = #(%Kxis> 
8™ = 
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magnitude 


^ cor — 


u cor 

A t 

/'crp ^err 

At 
1 

Verp A? 

— /'crp /'fps^crr 

— /ucor^eiT’ 


(7.124a) 

(7.124b) 

(7.124c) 

(7.124d) 

(7.124e) 


The direction of this coiTecting angular velocity is dictated by the w-vector, since 


^cnr — KW 


U 

||u|| 

u 


|li|| 

u 


- /''COfl/ciT j 

— kcor@evr~. 7 • 

Sill c/grr 

In the last step we used that s i wcs and s- wcs are unit vectors, such that 


(7.125a) 

(7.125b) 

(7.125c) 


||u|| = ||s* wcs x Sj wcs || = sin 9 err . (7.126) 

We expect the error to be small, so it is reasonable to use the small angle approximation, where 8 en ~ 
sin# err , i.e., 

^■Wor = k cor u. (7.127) 


We know that u is orthogonal to s ax ^ cs , so we project it onto the vectors t { wcs and t 2 wcs , and we end up 
with the error measure 


-'hinge 


= k c 


(rj + R(qj)r{ nc - ri - R(qi)r l anc ^j 

t, wcs • u 

t 2 wcs • u 


(7.128) 


7.7.3 Slider Joint 

The slider joint only allows translation in a single direction, as shown in Figure 7.12. Hence there is only 
one DOF, so the Jacobian of the slider joint, J s iider- must be a 5 x 12 matrix 


slider 





(7.129) 


where Jj in £ M 5x3 , J* ng £ M 5x3 , Jj- m £ M 5x3 , and j{ Dg £ M 5x3 . We will use the first three rows of the 
Jacobian to ensure that the two bodies connected by the slider joint do not rotate relative to each other, 
hence we require that they have identical angular velocity. 
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Figure 7.12: A slider joint example. 


The last two rows of the Jacobian is used to make sure that the bodies only move relatively in the 
direction of the joint axis, .s ax * cs . This is done as follows: first we note the following relation between the 
bodies’ linear velocities 

Vj = Vi + U>i X C + V slider j (7.130) 

where c = rj — r t , and lender is the joint velocity along the slider axis. Recalling that ui j = u> 3 , we can 
rewrite the velocity relation as follows: 


Vj — Vi T" LV, X C '^slider) 
^slider = ~ Vj + CJj X C, 

(jJi OJ j 

“^slider = Vi~ Vj 4-—-- X C. 


(7.131a) 

(7.131b) 

(7.131c) 


From the joint axis, s ax * cs , we can compute two orthogonal vectors t 1 wcs and t 2 wcs . By the workings of 
a slider joint we know that we may never have any relative velocities in the directions of the two vectors 
wcs and t 2 wcs . That is, 


0 = i, wcs -(— raider), (7.132a) 

0 = ^2 WCS ‘ ( — ^stider) • (7.132b) 


From these two equations we can derive the remaining two rows in the Jacobian slider matrix, and we find 
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T J - 

•Min — 


0 

0 

0 

0 

0 

0 

0 

0 

0 

(*1 

wcs 

) T 

.(^2 

wcs 

) T 

' 0 

0 

0 

0 

0 

0 

0 

0 

0 


p = 

ang 


7 J = 

ang 


_( ti w cs)J 

-(*2 WCS f. 

10 0 ' 

0 1 0 
0 0 1 
ic x 

i wcs 

2 L. A 1/2 

-10 0 
0-10 
0 0-1 

±C X t [ wcs 
I/O V + wcs 

2 /n i/2 


(7.133a) 


(7.133b) 


(7.133c) 


(7.133d) 


Now we will look at the error term, 6 s iider £ R 5 . The first tree entries arc used for rotational misalign¬ 
ment between the two links, such as sliding along a bend axis. The last two arc used for fixing parallel 
positional displacement of the joint axis. 

As for the hinge joint, we derived an angular velocity to correct the misalignment error of 9 en radians. 
The magnitude of this correcting angular velocity is as before 

IKor|| = % (7.134a) 


/'erp f^err 


(7.134b) 


— ^erp u err 

— Z'erp A-fps^err 

— Arort/err* 


(7.134c) 

(7.134d) 

(7.134e) 


As before, the direction of this correcting angular velocity is dictated by an rotation axis given by some 
unit //-vector 


k^cor — H^corll ^ 

— kcar^errU. 


(7.135a) 

(7.135b) 
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However, unlike previously, the correcting angular - velocity will be derived as follows: let the rotational 
misalignment be given by the quaternion, q en , then we have 


Qen = [s,v], 



The error is suspected to be small, so the small angle approximation is reasonable and we find 



Using this in our formula for the correcting angular velocity, we get 

k^cor = kco^V. 


(7.136a) 

(7.136b) 


(7.137) 


(7.138) 


This will be the first three entries in the 6 s / i( fe r -vcctoi'. 

We can describe the current joint position by an offset vector, r of j vcs , which indicates the initial differ¬ 
ence between the body centers, that is, 


r off = R (Qj) T ( r J - r i) ■ (7.139) 

Observe this offset vector is computed when the joint initially was set up, that is, before simulation, and it 
is a constant. The corresponding offset in the world coordinate system is then simply found as 

U,fT s = R(qj)rJ ■ (7-140) 


If there is no parallel displacement of the joint axis, then the vector, c — r n| | vcs will have no components 
orthogonal to the joint axis. From this observation we have the last two entries in the vector 6 s iider> 


^slider — k c 



2v 


f wcs 

f wcs 

_ 2 

1 1 

r wcs\ 
r off ) 
r> wcs\ 
r off ) A 


(7.141) 


7.7.4 Hinge-2 Joint 

The hinge-2 joint is also called a wheel joint because its motion resembles that of a turning front wheel on 
a car. Therefore, we will explain the workings of this joint type by the example of a car wheel as shown 
in Figure 7.13. 

The wheel joint is the same as a series of two hinge joints. Its motion is described by a rotation axis, 
S axisi’ given by a unit vector in the body frame of body i, and another rotation axis, sj S2 , given as a unit 
vector in the body frame of body j. 

In the following, we will implicitly assume that body i is the car and body j is the wheel. Using this 
convention, the axes are referred to as the steering axis or suspension axis and the motor axis. 




Figure 7.13: A car wheel joint example. 


We will also use an anchor point like before, where both axes arc going through this anchor point. We 
will assume that the axes do not lie along the same line, and that they arc always separated by the initial 
angle, 9, between them. 

From the description, it is clear that the joint has two DOFs, from which we know that the wheel joint 
Jacobian must have dimension 4x12 

Jwheel = [JL J\ a g J] m jQ , (7-142) 

where J] in 6 M 4x3 , J* ng g K 4x3 , j{ B G M 4x3 , and j{ ng g K 4x3 . 

Following the same recipe as previously, we reuse the ball-in-socket joint for the positional constraints 
and we arc now left only with the fourth row in the Jacobian matrix. 

Let’s compute the joint axis in the world coordinate system 

s, wcs = R( qi )sJ e , (7.143a) 

a i wcs = «(<&>«£, (7.143b) 


then the constrained rotational DOF is dictated by a rotational axis orthogonal to the two rotation axes, 
that is, 
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For the hinge to keep its alignment, we must ensure that there is no relative rotation around this axis 

u • u>i — u • ujj = 0. (7.145) 

This give us the missing fourth row of the Jacobian matrix 


r 


lin 





1 0 0 
0 1 0 
0 0 1 ’ 

0 0 0 

'-10 o' 
0-10 
0 0 -1 ’ 

0 0 0 

(^(9i) r L) X 

T ’ 

u 

. ( R (<ij) r U x 

T 

u 


(7.146a) 


(7.146b) 


(7.146c) 

(7.146d) 


From the ball-in-socket joint we also have the first three entries of the error term, b w heei £ K 4 - Therefore, 
we only need to come up with the fourth entry to reestablish the angle 6 between the two joint axes. Let’s 
say that the current angle is given by (f>, then we need a correcting angular velocity of magnitude 


II II _ ^cor 

ll^corll - 

_ ^erp (G — 0) 

~ At 

= k ei p-^ (6 — <fi) 

— /Cerpfcfps (0 0) 

= k c o r (0 0) . 

We can now write the error-term vector as 


^wheel — k c , 


-'ball 


W-t) 


(7.147a) 

(7.147b) 

(7.147c) 

(7.147d) 

(7.147e) 


(7.148) 


Finally, two more tricks arc possible: first, one rotates the axes of the ball-in-socket joints, such that the 
first constraining axis is along the suspension axis, which allows one to model suspension by modulating 
the translational error in the ball-in-socket joint along its first axis. Second, a small angle approximation 
for the fourth entry in the error-term vector may be used [ODE, 2005]. 
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Figure 7.14: A universal joint example. 


7.7.5 Universal Joint 

The universal joint is in some sense si mi lar to the wheel joint, and is described similarly by the two joint 
axes 


(7.149a) 

(7.149b) 


and an anchor point, which the two axes run through. The difference from the wheel joint is that it is 
further required that axis two makes an angle of 7r/2 with axis one. An example of a universal joint is 
shown in Figure 7.14. 

We notice again that we have two DOFs, and the Jacobian matrix of the universal joint, J universal* must 
be a 4 x 12 matrix 


•^universal 



/■? 

*Min 



(7.150) 


Since this joint type has derivations almost identical to previous types, we will ease on notation and go 
through the steps faster. We start out by reusing the ball-in-socket joint for the positional constraints, and 
then we compute the constrained rotation axis 


U = 8i WCS X 8j WCS , 


(7.151) 


along which we know there must be no relative angular velocity. We can now write the Jacobian matrix 
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for the universal joint as 


r 


lin 


p 

" lin 


p 

ang 


JJ 

ang 


1 0 0 

0 1 0 

0 0 1 

0 0 0 

-1 0 

0 -1 

0 0 

0 0 




0 

0 

-1 

0 

X - 


u 


(R(Vj)rL)' 

,T 


u 


We already have the first three entries of the vector, 6 universal, and we 
first looking at the magnitude of the correcting angular velocity 


K-t-V.nr — 


^cor 

A t 

k e rp (4> - f) 


At 


= kr 


(7.152a) 

(7.152b) 

(7.152c) 

(7.152d) 

must find the fourth. We do this by 

(7.153a) 

(7.153b) 

(7.153c) 


where 0 denotes the current angle between the two joint axis. If 0 is close to it/ 2, then 


7T 


0 - o ~ COS ( (j)) 


_ wcs wcs 

- ■ Sj . 


We can now write the error-term vector as 

^universal — ^cor 


Oball 

_ c wcs c wcs 

s i ' s j 


(7.154a) 

(7.154b) 


(7.155) 


7.7.6 Fixed Joint 

For fixed joints, we know that it constrains two bodies completely from any relative movement, and it 
therefore has zero DOFs, from which we know that the Jacobian matrix, Jfi xe d F M 6x12 


^universal 


P P 

u lin u ang 


T J 

" lin 



(7.156) 
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The fixed joint is described by an anchor point, and initially we compute an offset vector and store it in 
the body frame of body i 

r o b = r i~ r j- (7.157) 

Observe this offset vector is computed when the joint initially was set up prior to simulation, and it is a 
constant. The corresponding offset in the world coordinate system is then found by 

r o r = R(qi)r 0 l (7.158) 

Since we have a fixed joint, both incident bodies must be rotating with the same angular velocity and the 
lineal - velocities must obey the relation 


vj = Vi + Ui x cs . 


From all this we can now set up the Jacobian matrix as 


r 


lin 


p 

‘Min 


p 

17 ang 


/3 

17 ang 


1 o o 
0 1 0 
0 0 1 
0 0 0 
0 0 0 
0 0 0 


-1 

0 

0 

0 

0 

0 


0 

-1 

0 

0 

0 

0 


0 

0 

-1 

0 

0 

0 


_ (r, wcs\ : 
('off > 


1 

0 

0 

■ 0 
0 
0 

-1 

0 

0 


0 

1 

0 


0 

0 

0 

0 

-1 

0 


0 

0 

0 

0 

0 

-1 


(7.159) 


(7.160a) 


(7.160b) 


(7.160c) 


(7.160d) 


Similarly, the error term is straightforward, since most of it was presented for the ball-in-socket joint, b bail, 
from where we get the first three entries, for positional error, and from the slider joint, s iidei - We may 
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reuse the first three entries to take care of any rotational errors. We can now write the bfi xe d vector as 


frfixed — k c , 


Ti + R{qi)rJ c Tj R{qj)rJ c 


anc J 

2v 


where v comes from the quaternion representing the rotational error 

<?err = [s,«] • 


(7.161) 


(7.162) 


7.7.7 Contact Point 


As we have explained, contact constraints arc completely different from joint constraints, but they too, arc 
described by a Jacobian matrix. As we will show, this Jacobian matrix J con tact> can be expressed in the 
same submatrix pattern as the joint Jacobians; one can even construct an error correcting term. 

From previously, we know that the contact Jacobian has 1 + rj constraints, so it is (1 + ?/)-by-12 
dimensional matrix, 


' contact 


V 

•J u 


tin 


ang 


p 

'Min 


JJ 

’ ang 


(7.163) 


The first row corresponds to the normal force constraints and the remaining // rows correspond to the 
tangential friction constraints , that is, 





- ( ri x n) T 

- ( ri x D k f 

0 rj x nf 
(rj x D k f ' 


(7.164a) 

(7.164b) 

(7.164c) 

(7.164d) 


If the penetration constraints arc violated, then an error correcting vector, b C ontact £ M l+f? , can be used as 


= k c , 


^penetration 

o 


(7.165) 


where ^penetration is the penetration depth. These observations regarding the submatrix patterns of the 
Jacobians of both the contact and joint constraints allow us to implement these kind of constraints using 
almost the same kind of data structure. 
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7.8 Joint Limits 

It is not always enough just to set up a joint, even though the relative motion is constrained to move only 
in a consistent manner w.r.t. the joint, other constraints need our attention. For instance, in the real world 
we cannot find a jointed mechanism with a sliding joint that has an infinitely long joint axis. In other 
words, we need some way to model the extent of a sliding joint. We will do this modeling by setting up 
joint limits. To be specific, we will treat joint limits on a sliding joint and a hinge joint. 

The general approach we will take here for introducing joint limits is very similar to the way we use 
unilateral constraints for enforcing normal nonpenetration constraints at the contact points. In a sense, 
setting joint limits this way is nothing more than a slightly exotic way of computing contact points for 
normal force constraints disguised as joint limits. 

7.8.1 Slider Joint Limits 

Recall that in specifying the slider joint we used an offset vector, which was the initial difference between 
the origin of the body frames: that is, at time t = 0, we compute 

r 0ff = ( r j - r i) • (7.166) 

As before, the initial offset w.r.t. the bodies current location in the world coordinate system is computed 
as 

r 0 r = (7-167) 

Now letting c = rj — r. t , we can compute the current displacement, rdis, along the joint axis as 

r-dis = c-r off wcs . (7.168) 

Taking the dot product with the joint axis, s ax " cs , gives a signed distance measure of the displacement 
along the joint axis, 

ddis = «axe CS ' r dis- (7.169) 

When the joint limits are imposed on the slider joint, we want to be able to specify a lower-distance limit, 
d[ 0 , and an upper-distance limit, <7 hi- If one of these arc violated, for instance, 

ddis < die, (7.170) 

then we will add a new unilateral constraint. This constraint specifies that the relative velocity of the joint 
along the joint axis must be such that the displacement does not move beyond the limit. For a slider joint 
that means we require 

*axe CS • (Vj - Vi) > 0. (7.171) 

We immediately see that this is equivalent to a 1 x 12 dimensional Jacobian matrix, J sl y er , where 


ji ( a wes \T 

u lin l*axe ) > 

(7.172a) 

jj _ (a WCS \T 
u lin V*axe ) i 

(7.172b) 

p =0 

J ang 

(7.172c) 

J j = 0 

* ang w • 

(7.172d) 
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However, this would be the wrong Jacobian to use because when we look at the reaction forces from 
the joint limit, this Jacobian will not model the torques coming from the limiting force, only the linear 
contribution is included. 

We will now remodel the Jacobian to include all the force and torque contributions. Let’s say that 
the position of the joint limit is given by the vectors: r h ™ cs and r n " cs . These vectors are specified in the 
world coordinate system and are running from the respective body centers to the position of the joint limit. 

Now say that some force, F, is acting on body i at the joint limit, then the force is required to be 
parallel with the joint axis .s ax " cs . The force contribution to body j is — F according to Newton’s third law 
of motion. The limit force also result in a torque on body i 

-Him. = r lim w “ x F, (7.173) 

and a torque on body j, 

'Him, = -rqjf x F. (7.174) 

For a slider joint we must also require that these torques do not induce a relative angular velocity of the 
two bodies, meaning that angular momentum should remain unchanged. From Euler’s equation, we get 


Tlim, + Tum. = 0. (7.175) 

Recalling that the corresponding reaction force is given by 

^slider = («/slider) T Ato, (7-176) 

where Ai 0 is a nonnegative Lagrange multiplier. This suggest that the Jacobian should look like 


ji wcs\T 

° tin V*axe I 5 

jj _ wcs \T 
u lin — V d axe ) ? 

ji — ( r wcs v wcsW’ 

u ang \ # linii ^ °axe / : 


JLo = - [r 


ang 


lim 7 * 


X 


(7.177a) 

(7.177b) 

(7.177c) 

(7.177d) 


However, it is not obvious that J! in „co ( + Jj n „LO y = 0, also to avoid computing the vectors TqWj 
r iim CS during simulation it would be nice to remove them from the expressions. 


and 


Observe that the vector, c = r ? - — r*j, can be written as, c = r i; " cs — r ,W CS . We will now show that 


lim 7 * 


with the c-vector we can obtain: 


lim 7 - 


linio 


xF=LxF = i(r„~-r ll ”) x F. 


xF = -^cxF = --(r 1 W cs -r, : 


lim 7 


x F. 


(7.178a) 

(7.178b) 


From the second equation we have 


lim 


WCS v 171 _ _ ^ WCSy p 1 „ wcs jp 

x * ~ - r iim i x ^ + -r lim . X t , 


2 ’ lim J 


(7.179) 
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which yields 

fi”xF = —(7.180) 
and substituting this into the first equation yields 

i-ltof xF-i (r,,” - r„“) X F (7.181a) 

= x F - x F (7.181b) 

= x F - X f) , (7.181c) 

J-ltof x F - r, x F. (7.18Id) 


This proves (7.178a). Repeating the steps but interchanging equations easily derives (7.178b). Observe 
also that the sum of the two equations is equal to zero as required by Euler’s equation. We can now rewrite 
the angular parts of the Jacobian as 

4n = -« CS ) T i (7-182a) 

A n = (^axD T , (7.182b) 

4)g = \ (c X s™) T , (7.182c) 

4 g = -^(cxCf. (7.182d) 

To verify that J' l wg Ui + = 0, we insert (7.182c) and (7.182d) and find 

\ ( c X S axe CS ) T - \ (c x Saxe CS ) T ^i = 0, (7.183a) 

=► i(cxCfK-Wj) = 0, (7.183b) 

and because we have a slider joint u>j = In conclusion, we see that with the Jacobian in (7.183), both 
the kinematic constraints arc satisfied, and the reaction forces are prober. 

Putting it all together, we have the complementarity constraint 

*7 s iider u > compl. to Ai 0 > 0. (7.184) 

An error correcting term, & sl y^. is easily added to the right side of the kinematic constraint as 


u 1° — u 
''slider 


dlo ddi f 


erp 


At 


— /'cordcrr - 


(7.185a) 

(7.185b) 


where we have set d en = d\ Q — da s . 
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In conclusion, we have derived a single linear complementarity constraint for the lower joint limit of 
a slider joint. The same approach can be used to derive a single linear complementarity constraint for the 
upper limit. It should be apparent that all that is really needed is to negate the Jacobian, that is, 


and the error term 


h hi — u 
'*'1 


— T lo 
u slider’ 

(7.186) 

7 ^hi ^dis 

k “■> At 

(7.187a) 

^cor^err • 

(7.187b) 


7.8.2 Hinge Joint Limits 

There really is not much difference between setting limits on a slider joint and a hinge joint. The major 
difference is that the joint axis now describes a rotation axis, and instead of distances, we use angle 
measures. 

If we store the initial relative rotation of two bodies in the quaternion, q m \, then we can compute the 
current relative rotation of the two bodies as 


fei = (7.188) 

This quaternion corresponds to a rotation of 6 radians around the unit axis, v, that is the angle measured 
from i to j around the joint axis, 


9rel — 



(7.189) 


Using standard trigonometry and taking care of the double representation of rotations by picking the 
smallest angle solution, one can extract the angle from the quaternion of relative rotation. Specifically, 
taking the dot product with the vector paid of the quaternion, sin |, is obtained since v v = 1, now arctan 
can be used to obtain |. 

As we did in the case of the slider joint, we want to impose a low- and high-joint limit, 6\ 0 and 0^. If, 
for instance, the lower limit is violated as 

9 < 9 lo , (7.190) 


then we will add a new unilateral constraint, which specifies that the relative angular velocity of the joint 
around the joint axis must be such that the angle does not move beyond the limit. For a hinge joint this 
means that we must require 

a,- 8 • ( Uj - w< ) > 0. (7.191) 

We immediately see that this is equivalent to a 1 x 12 dimensional Jacobian matrix, where 


©in ~ 0, 

(7.192a) 

4n = 0, 

(7.192b) 

ji _ _ r wcs\T 

J ang V^axe ) ’ 

(7.192c) 

jj — wes \T 

J ang V^axe ) 

(7.192d) 
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The corresponding reaction force from the joint limit is given by 

^4= (^hinge) T A l0 , (7-193) 

where Ai 0 is a nonnegative Lagrange multiplier. We see that we have the complementarity constraint 

Jhinge M — 0’ compl. to Aio > 0. (7.194) 

An error correcting term, & hi ],° e , is added to the right side of the kinematic constraint as 

Oge = (7.195a) 

= k cor 9 en , (7.195b) 


where 9 erv = 9\ 0 — 9. 

The constraints for the high limit of the hinge joint is obtained by negating the Jacobian 

T hi T 1° 

** hinge ** hinge i 


and the error term is given by. 


J, hi - - 
^hinge 


9 hi - 9 
' erp At 


— kcar9e 


(7.196) 


(7.197a) 

(7.197b) 


7.8.3 Generalization of Joint Limits 

In the previous sections we derived constraint equations specific for low- and high-joint limits on slider and 
hinge joints. Fortunately, it is possible to extend these ideas to a more general framework. For instance, 
the concept of a reach cone, i.e., multiple angular joint limits, is often used in biomechanics to describe 
the limited movement of the shoulder or hip joints in the human skeleton [Wilhelms et al., 2001], 

In fact, we could formulate the allowable configuration space or the reachable region for a joint, by an 
implicit function, C{...) £l, of the joint parameters. For the slider and hinge joints, the joint parameters 
are the displacement and the angle, which we will specify with the generalized joint parameter vector, q, 
as a function of the generalized position vector, s. Furthermore, the implicit function has the following 
characteristics, 


C(q(s)) < 0 Outside, (7.198a) 

C(q(s)) = 0 On boundary, (7.198b) 

C(q(s)) > 0 Inside. (7.198c) 

We can now reformulate positional constraints as 

C{q{s))> 0. (7.199) 
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Differentiation w.r.t. time leads to the kinematic constraint. 


s C( «< s)) 

> 0 , 


dC(q(s)) dq ds 
dt 



which could be augmented with an error reduction term 

J C u> b c . 


The reaction forces arc determined by 


T^C _ 

r reaction 


jT 

J C 


*c, 


(7.200a) 

(7.200b) 

(7.200c) 


(7.201) 


(7.202) 


where Ac is a vector of nonnegative Lagrange multipliers. Finally, we have the complementarity con¬ 
straints, 

Jcu — be > 0, compl. to Ac > 0. (7.203) 

The constraints in terms of the Jacobian and the error term should be added to the system equations, 
whenever a joint limit has been violated or its boundary has been reached. This is completely analogous 
to collision detection; enforcing joint limits in this manner is therefore not much different from finding 
contacts and computing normal forces. 

Observe that in a force-based formulation, the kinematic joint limit constraints should be differentiated 
w.r.t. time to get the acceleration constraints needed for augmenting the system equations. 


7.9 Joint Motors 

With joints and joint limits, we are capable of modeling the range of relative motion between two bodies. 
We will now look at one way to control the motion that is taking place. 

A joint motor applies torque or force to a joint’s degrees of freedom to induce movement. The joint 
motor model uses two parameters for this: a desired speed, ndesired^ and the maximum torque or force, 
A m ax. that can be applied to reach the desired speed. 

From past sections we have seen that the error correcting term can be used to adjust velocities. The 
same principle can be used to drive a joint toward a desired speed by 


Ti p p 

u tin u ang J lin 


J motor^ A b 


motor i 


p 

J ang 




U> ; 


> b 


_ ‘'motor ■ 


(7.204a) 


(7.204b) 
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For a one DOF joint like a slider and a hinge joint, the motor Jacobian will have dimension 1 x 12, and 
the right-hand side will be a scalar. In fact. 


bmotor — ^desired- 


; Jacobian is also easy to derive for these two cases 


r slider _ r wcs p _ wcs 

J motor L^axe > u ’ *axe 

T hin S e _ [n _ O wcs r. wcsi 
u motor °axe > ‘’axe J 


So], 


(7.205) 


(7.206a) 

(7.206b) 


The motor force is given by the relation 


? _ j i \ 

motor — u motor^motor? 


(7.207) 


where A m0t0 r is a Lagrange multiplier, that can be interpreted as a measure of the magnitude of the joint 
force along the degrees of freedom in the joint. By setting upper and lower limits on A motor, we can model 
the aspect of the maximum available force, that is, 

— Amax L A mo tor L A max . (7.208) 


Finally, we require the force and the desired velocity term to be complementary to each other, that is, 

J motor W T 6 motor , COmpl. to | Amotor | — Amax* (7.209) 

The basic idea behind this is that if the velocity exceeds the desired velocity, then the motor force should 
work at its maximum to bring the velocity back to the desired speed. 

On the other hand, if the desired speed has been reached, then the motor force can assume any value 
between 0 and |A max | to keep the desired speed. 

In conclusion, we have developed linear complementarity constraints with both upper and lower limits 
for joint motors. The theory we have outlined can be extended to more complex joint types in a straight¬ 
forward manner, simply by formulating Jacobians for their degree of freedom movement. 

Another aspect of joint motors, which comes in handy, is that they can be used to model friction in 
joints. This is done by setting the desired velocity to zero and the maximum force to some constant value; 
then all joint motion will be slowed down by th e frictional motor force. 

To drive the joints to a specified position, a positional joint motor would simply be a hybrid of the 
joint limit model and the joint motor models we have outlined. The trick lies in setting the lower- and 
upper-joint limits equal to the wanted position, and then limiting the motor force as we did in this section. 

In contrast to real-life motors and engines, the presented motor controls capture the essential idea 
of limited power and the need for controlling the speed. Higher-level controllers could be added to a 
simulator (motor programs) for manipulating the joint motors, but this is out of the scope of this book. 
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Algorithm fixed-time-step (s 4 ,u 4 , At) 
s ' = s 4 + A tSu 4 
A = LCP(s ',u 4 ) 

u 4+A4 = u 4 + M - 1 (J T A + Af/ ext ) 

« 4+At = « 4 + AtSw 4+A4 
return s t+At 
End algorithm 


Figure 7.15: Fixed Time-Stepping. 


7.10 Time-Stepping Methods 

In order to calculate the movement of rigid bodies, the simulation loop needs to advance the simulation 
time. This process is called a time-stepping method or time control. Knowing how to compute contact 
and constraint forces or impulses at collisions, the time-stepping method sets up a scheme that integrates 
the forces in order to obtain the motion of the rigid bodies. In the following, we will discuss mainly fixed 
time-stepping methods. 

The matrices M, C , N, and D depend on the generalized position vector s, which itself is dependent 
on time. 

In some simulators the time dependency is ignored and the simulator uses a fixed time-stepping rou¬ 
tine, such as is the case of the Open Dynamics Engine [ODE, 2005]. The general idea is illustrated in Fig¬ 
ure 7.15, and the main advantage is that only a single LCP problem is solved per time-step. Unfortunately, 
it also leads to penetrations and drifting problems. In the Open Dynamics Engine several heuristics are 
used to overcome these problems: constraint Force Mixing, Error Reduction Parameter, and a specialized 
position update. These heuristics will be discussed in this section. 

Now we will discuss the detection of contacts. There are several ways to approach this for the pseu¬ 
docode in Figure 7.15. One way is to invoke the collision detection at time t and hope for the best, but 
new contacts might develop during the time from t to t + At. The future contacts that are overlooked at 
time t could potentially end up as violated contacts at time t + At. That is, deeply penetrating contacts. 
This is illustrated in Figure 7.16. 

Stewart and Trinkle [Steward et al., 1996] propose a retroactive detection approach to detecting over¬ 
looked future contacts. First, all contacts are detected at time t, then a fixed time-step is taken, and finally 
all contacts at time t + At are detected. If any new and violated contacts are found, then these contacts 
are added to the set of contacts found at time t, the simulation is then rewound to time t, and a new fixed 
time-step is taken. These steps are repeated until all the necessary contacts have been found. Pseudocode 
can be found in Figure 7.17. Although it is obvious that only a finite number of contacts exist, and that the 
algorithm sooner or later will have detected all contacts necessary for preventing penetration, it is, how¬ 
ever, not obvious how many iterations the algorithm will take, and it is therefore difficult to say anything 
useful about the time-complexity. The retroactive detection of contacts is similar to the contact tracking 
algorithm described in Section 14.3. 

Detecting a future contact at an early stage can have an undesirable effect, in essence, a contact that 
appeal's at time t + At is resolved at time f. This would make objects appeal' to have a force field or 
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time t 


time t+dt 




Figure 7.16: 2D illustration of the problem with violated contacts in a fixed time-stepping due to over¬ 
looking potential future contact. Small white circles show valid contacts and black small circles show 
violated contacts. Notice that contacts not detected at time t will be detected as violated contacts at time 
t + dt. 


Algorithm retroactive-1 (s t ,u *, At) 
repeat 

Scontacts = collision detection(s { ) 
s t+At = fixed-time-step(s t ,u t , At) 

^contacts = collision detection(s t+At ) 
if violated contacts in ^contacts then 
for all contacts c £ S C o ntacts do 
if c violated and c ^ S CO ntacts then 
add C to contacts 
next c 
else 

return s t+At 
end if 

until forever 
End algorithm 


Figure 7.17: Retroactive Detection of Contacts in a Fixed Time-Stepping Method. 
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Figure 7.18: 2D example showing visual artifacts of resolving future contacts at time t. The elastic ball 
never touches the fixed box before it bounces; the inelastic ball is hanging in the air after its impact. 



Figure 7.19: System moving along a concave boundary of the admissible region in configuration space. 


envelope surrounding them, preventing other objects from actually touching them. Most noticeable, this 
could cause an object hanging in the air, as shown in Figure 7.18. If we arc simulating fast moving and 
bouncing objects, it is unlikely that an observer would notice the visual artifacts, but if the time-step is 
big enough, the artifacts illustrated in Figure 7.18 may annoy an observer. To remedy this effect, one 
could either lower the time-step size, which causes a performance degradation, or one could remodel the 
error terms to take into account that a contact does not exist at time t. This approach was elaborated on in 
Section 7.3. 

Time-stepping methods may result in penetrations when the system moves along concave boundaries 
of the admissible region in configuration space [Anitescu et al., 1997, Stewart, 2000]. Configuration space 
is the vector space consisting of the concatenation of the generalized position and velocity vectors. The 
admissible region is the subspace of states that make physical sense, like states that do not cause a penetra¬ 
tion. A simple example is illustrated in Figure 7.19. As can be seen in the figure, the dotted line indicates 
the normal constraint; therefore the box is allowed to move along this line. Furthermore, since the lower 
fixed object is concave at the touching contact point, a penetration will always occur no matter how small 
a step the box moves along the dotted line. This indicates that situations will arise where all necessary 
contacts have been detected, but still the algorithm will find violated contacts causing an infinite loop 
since the (/’-statement in the pseudocode of Figure 7.17 always will be true. This seems to be an unsolved 
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Figure 7.20: Different ways to project a violated contact back into a valid contact. The horizontal pro¬ 
jection does not change the potential energy of the box; both the vertical and the inclined projections 
increase the potential energy. In the following time-step, the added potential energy is transformed into 
kinetic energy, resulting in a more violent collision between the two objects. 


problem in the literature. 

Stewart and Trinkle [Steward, 2000] suggest that simple projection can be used to eliminate the prob¬ 
lem of penetrations as a result of moving along concave boundaries in configuration space. They state, 
however, that care must be taken to avoid losing energy in the process. In [Baraff, 1995] a displacement is 
computed by using a first-order system Af = b, where A only contains normal and bilateral constraints. 
In our notation, this corresponds to 

. _ r N t C t M~ 1 CN N t C t M~ 1 J^ 1 

~ [ J%M- l CN JlM- 1 J ' ( ' 0) 

The b-vector contains the signed constraint errors. From the solution, /, the derivative, s, can be computed 
as described in Section 7.14. Now using As = s, a displacement can be obtained as s t+At = s t+At + As 
(see [ Baraff, 1995] for more details). 

Solving penetration errors by projection can change the potential energy of the system, thus changing 
the total energy in the system. Figure 7.20 illustrates the energy problem with the projection method. In 
the figure, three possible projections of the box arc shown as dashed boxes. Due to the gravitational field 
g, both the vertical and the inclined projections will increase the energy of the system, while only the 
horizontal projection will maintain the same energy level. If the geometries arc mirrored in the vertical 
direction, then the vertical and inclined projections will result in a loss of energy. In other words, projec¬ 
tion must be done in such a way that the potential energy remains unchanged. It is not obvious to us how 
a general method of projection can be designed to fulfill this. 

Another approach for handling penetration errors is to use constraint stabilization. This can intuitively 
be explained as inserting small damped virtual springs between objects at those places where penetration 
errors arc detected. In Section 7.6 and 7.7, the constraint-based method was extended with error correcting 
terms that stabilized the constraints. An error reduction parameter was used to control the amount of error 
correction. In case of large errors, or over-eager error reduction, stabilization can cause severe alterations 
of the simulated motion because the stabilization is accelerating objects apart in order to fix the error. 
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Figure 7.21: Example of constraint stabilization on a violated contact of a resting box causing the box to 
bounce off the resting surface. 

Thus the stabilization adds kinetic energy to the system; in severe cases stabilization can cause a shock¬ 
like effect on constraints with large errors. The artifact of stabilization is illustrated in Figure 7.21. Several 
approaches exist for minimizing the problem, but none of them completely removes it. By lowering the 
time-step size, the magnitude of constraint errors would also be lowered, thus decreasing the chance of 
large constraint errors and degrading performance. Lowering the error reduction parameter minimizes 
the chance of stabilization being too eager; the drawback is that error correction will take longer, thus 
increasing the chance of an observer noticing deep penetrations of objects. Finally, constraints can be 
made softer by using Constraint Force Mixing as described in Section 7.13. This has the same drawbacks 
as lowering the error reduction parameter; furthermore, it removes kinetic energy from the system. Thus 
if used too aggressively, objects appeal - lazier than they should. 

The matrices M, C, N, and D depend on time; the constraint-based method should rightly be formu¬ 
lated as a nonlinear complementarity problem (NCP). A NCP can be solved iteratively by using a fix-point 
algorithm as shown in Figure 7.22. 

In [Anitescu et al., 1997] an explicit time-stepping method is used together with retroactive detection 
of collisions. We have outlined the general control flow in Figure 7.23. The main idea is to halt the 
simulation at the time of an impact and then handle the impact before proceeding with the simulation. In 
[Anitescu et al., 1997], two LCP problems are set up and used to solve for postvelocities of the impact. 
This is done using Poisson’s hypotheses for handling the compression and decompression phases of an 
impact. In [Baraff, 1989], a simpler approach is used based on Newton’s impact law. This leads to nearly 
the same kind of LCP problem we have outlined. Later in [Anitescu et al., 2002], an even simpler collision 
model is used, which only supports completely inelastic collisions. The methods in [Anitescu et al., 1997, 
Baraff, 1989, Anitescu et al., 2002] are termed simultaneous collision methods; we will not treat these 
further. Another possibility is to use a so-called sequential collision method, which is treated in more 
detail in Section 6.2.4. 
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Algorithm f ixpoint (s *, u At) 
s ' = s t + A tSu 4 
repeat 

A = LCP(s ',u *) 

u' = u t + M^(J T \ + Atf^ 

s " = s ' 

s ' = s t + A tSu ' 
until |s ' — s "\ < £flx 
0 t+At = s ' 
return s t+At 
End algorithm 


Figure 7.22: Fix-point-iteration algorithm. Typical values according to [Sauer et al., 1998] arc At < 10 3 

and €f ix < 10 -4 . 


Algorithm retroactive-2 (s l ,u 4 , At) 


h — At/Usteps 
t = 0 

while t < At do 
s " = s ' + hSu ' 

A = LCP(s ",u') 
if no collision then 
t = t + h 
s ' = s " 

u' = u t + M ' 1 (( J t A + A t/ ext ) 
else 

let ttoi be time of impact 
apply collision model 


t — ttoi 

end if 
end while 
a t+At = a ' 
return s t+At 
End algorithm 


Figure 7.23: Explicit time-stepping with Retroactive Detection of Colliding Contacts. 
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dt = T_target - T 
do 

simulate-forward(dt) 
if intersection 

rewind-simulation() 
dt = dt/2 
else if contact 

collision-resolving() 
T = T + dt 
dt = T_target - T 
end if 

while T<T_target 


Figure 7.24: Example of retroactive advancement based on a bisection root search algorithm. 


In [Anitescu et al., 2002] the same control flow is used in an implicit-time-stepping method, the major 
difference being the way the LCP is formed. We have chosen to omit the implicit version, since it is out 
of scope for the kind of applications we have in mind. 

7.10.1 Numerical Issues with Retroactive Time Control 

Due to numerical truncation and roundoff errors, thresholding is a necessary evil in order to determine the 
type of contact between two objects, A and B. Using a threshold value s, an often-used rule of thumb for 
determining the contact state is as follows: 

• if A and B arc separated by e, then there is no contact 

• if distance between A and B is less than e, then there is contact 

• if penetration between A and B is no more than e, then there is contact 

• if penetration between A and B is more than e, then A and B arc intersecting 

Now we'll review a typical retroactive advancement of the simulation. That is, if time control is handled in 
a root searching manner, the general idea is to watch out for penetrations and then backtrack the simulation 
to the time of impact when they occur. This is illustrated in Figure 7.24. Of course, more elaborate and 
intelligent root search schemes can be used, but the bisection scheme in Figure 7.24 suffices for our 
discussion in this section. 

Suppose A and B arc in resting contact and penetrate each other with a penetration of e, and imagine 
that during the forward simulation a small numerical drift causes A and B to interpenetrate with a depth 
greater than e. As can be seen from the pseudocode, the root searching algorithm will be fooled into 
thinking that a collision has occurred, even though no collision has occurred. Consequently, the root 
searching algorithm will begin to backtrack in order to search for the time of impact. The result will 
be a never-ending search for a nonexistent root. Even if a root is determined within a threshold value, 
the subsequent attempt to advance the simulation is doomed to repeat a new root search for the same 
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nonexistent root. In other words, the simulation will either go into an infinite loop never advancing the 
simulation, or alternatively, the system will end up in a faulty state with penetrations greater than e. 

One possible strategy to avoid the above-mentioned problems would be to adapt a more advanced 
thresholding method, using one threshold for the root searching algorithm and another threshold value for 
determining the type of contact [Baraff, 1995], However, it is extremely difficult to get such an approach 
to work efficiently and robustly in practice. Another way out would be to penalize penetrations by a force- 
feedback term. However, what you get is all the troubles of penalty methods, and there is no guarantee 
that the error will be within a certain limit, since a penalty force like a spring does not impose a hard 
constraint. 

If your simulator is already based on a penalty method, you have nothing to worry about; however, 
retroactive advancement is a very bad choice for a penalty-based simulator; instead, fixed time-stepping 
is often the preferred choice. On the other hand, if your simulator is a constraint-based simulator, it seems 
daunting to add penalty forces to it, since it essentially will turn your simulator into a hybrid simulator. Not 
only can you get the advantages from both types of simulator, but also you can inherit all the sicknesses. 

In our opinion, a far better approach for resolving the problem is to displace objects when their pene¬ 
tration depth is greater than a certain tolerance correction value. For example, set the tolerance correction 
to 

<5 = \e, (7.211) 

whenever the penetration depth increases beyond 5, objects are displaced to reduce or remove penetrations. 
This resolution technique will guarantee that you never will get into problems with your main simulation 
loop. However, there are some drawbacks; a displacement could change the potential energy of an object. 
If the energy is increased, the subsequent simulation might transform the potential energy into kinetic 
energy, causing objects to begin jittering and jumping around. In our opinion, it is more pleasing to look 
at a simulation where potential energy vanishes. Our main point is that displacements should occur such 
that the potential energy is nonincreasing. 

7.10.2 Unavoidable Penetrations 

In some cases, penetrations are simply unavoidable due to the discrete time-stepping. This can be hard to 
grasp, so we will present a small example, originally presented in [llerhpj, 2004], illustrating how pene¬ 
trations can occur. The example uses the fixed semi-implicit time-stepping method given in Figure 7.15. 
In the initial state the rod is moving to the right and the box is fixed as seen in Figure 7.25. The time¬ 
stepping method first tries to guess the next position by doing a fake position update, t' = t + At. This 
fake position is first used to determine contact points shown as small solid circles, and then the velocities 
are updated at the fake position. As shown in the figure, the velocity update predicts a tipping movement 
of the rod at the fake position. Now the time-stepping method will use the velocities computed at the fake 
position to update the real position. However, the tipping movement will cause an unwanted penetration 
of the rod and the box. A rule of thumb is that penetrations can occur in the same order as the order of 
the time-stepping method used. To be specific, the fixed semi-implicit time-stepping method introduced 
in Section 7.10 is of first order, i.e., O(h), where h denotes the step-size. The errors accumulate linearly 
with the number of steps n, such that the error after n steps is proportional to nh. This might appeal - to 
be bad, however from a convergence theory point of view, lowering h will guarantee better accuracy, and 
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t = t + dt 


Figure 7.25: Penetrations can occur with fixed semi-implicit time-stepping. 
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in the limit h i—> 0 a perfect solution will be found. In practice this is not feasible, since all limiting cases 
cannot be reached on a computer with finite precision arithmetic; even if the limiting cases were possible 
it would take far too long to compute. Therefore, a better and practical approach is to pick a step size, 
such that the errors never grow large enough for an end-user to notice them. 

Another example of unavoidable penetrations is the configuration shown in Figure 7.19. Here the 
problem is that we have a linear discretization, but a higher order surface. The numerics cannot see the 
higher order and do not care about it. 


7.11 A Unified Object-Oriented Constraint Design 


From past chapters we learned that four types of constraints can be expressed by using Jacobians. These 
arc joint constraints, contacts, joint limits, and joint motors. For these types of constraints we derived the 
following kinematic constraints 


•Tjoint^ — bjoint > (7.212a) 

•^contactW A b CO ntact> (7.212b) 

'/ limit " ' blimit) (7.212c) 

J motorw A b motor . (7.212d) 


Adding the reaction forces to the equations of motion results in the generalized acceleration 

^ < t t \ - - i tT \ i tT \_. . i tT \ i -e \ 


vector 


( jT \ I tT 

1 J ioint^ioint T 


eontQet 'Apnntart 


-I- .77 A, 


with the following limits on the Lagrange multipliers 


-OC ^Ajoint — i 

(7.214a) 

0 ^A CO ntact ^ 

(7.214b) 

0 ^Afimit ^ OO, 

(7.214c) 

'max ^A mo tor ^ A max . 

(7.214d) 


Performing the usual discretization steps and substitutions, we derive the following complementarity for¬ 
mulation 

(u + AfiW -i /ext) - bj( 


An 

^4-21 


-4.12 

4-22 



Ajoint 


J joint ( 

y / 

- 

Acontact 


J contact \ 


Alimit 

+ 

J limit ( 


Amotor 

Aaux 


J motor ( 


' ext) 

(u + A/M _1 /ext) 


’ ext 


^joint 

bcontact 


■■'motor 


Uaux 


> o, 


(7.215) 


where 


T AT -1 T t 

joint'*' •'joint 

T ■ AT” 1 T t 
" joint'*' " contact 

T AT” 1 T t 

" joint'*' •'limit 

T ■ AT” 1 T t 1 

° joint'*' •> motor 

T 1\ZT~ 1 T t 

’ contact'*' ^°int 

T A/f -1 T t 

° contact'*' contact 

T A/I~ l T t 

J contact'*' •'limit 

T AT” 1 T t 

•' contact'*' »' motor 

J limit 4T J joint 

•Tlimit -AT '^contact 

J limit-AT J ii m it 

•T limit-AT J motor 

T AT” 1 T t 

u motor 1V± u j oint 

T AT” 1 T t 

° motor'*' ** contact 

T AT” 1 T t 

U motor 1VJ u limit 

T AT” 1 T t 

J motor'*' *» motor J 


(7.216) 
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and A 12 , A 22 , A 21 , and 6 aux corresponds to a permutation of the third row and column in (7.60), that is 
the auxiliary constraints needed to model the frictional force. This is complementary to 


—00 


00 

0 


00 

0 

< A < 

00 

A max 


•^max 

0 


OO 



(7.217) 


where complementarity is understood coordinate-wise and is best explained by introducing the vector w 


w = AA + b. 


(7.218) 


The i’th coordinate of w is paired with the f th coordinate of A. Complementarity means that for all i, 
one of the three conditions below holds 


A* = A[ 0Wi , Wj > 0, (7.219a) 

A, = A h i g h t , w t < 0. (7.219b) 

Alowi T A i < Ahighjj^i — 0. (7.219c) 

Figure 7.26 helps illustrate the complementarity conditions in equation (7.219). Notice that as long as A, 
is within its lower and upper bounds, w, is forced to zero. The vector w, is only nonzero at the lower and 

upper bounds. Usually, the LCP is formulated with Ai ow = 0 and Ahi g h = 00 , in which case the above 

definition reduces to the familiar - notation 


A > 0 compl. to w > 0. 


(7.220) 


We refer the reader to [Cottle et al., 1992, Murty, 1988] for more details on LCP or Section 19.10. 
To ease notation further, we may concatenate all the Jacobians into a single matrix. 


J = 


J joint 
J contact 
J limit 
J motor 


(7.221) 


Similarly we may concatenate all the error correcting terms, 


terror — 


^joint 

^contact 

&limit 

^motor 


(7.222) 
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high. 


Figure 7.26: The complementarity condition on the i’th variable. 


and we may thus write 


-' 4-21 


- 4-12 

422 


A + 


(■u + A tM 

b. 


^ f ext) - b e 


> 0 


(7.223) 


complementary to 

Aiow < A < Ahigh- (7.224) 

Having rewritten the complementarity formulation into this form allows us to compute the system matrix 
very efficiently. Notice also how easy we can write the generalized acceleration vector with the new 
notation, 

u = M~ 1 (j T X + f ext ). (7.225) 

When assembling the system matrix, we must first allocate space for all matrices involved, and then set 
up the subblock structure for each individual constraint. We evaluate each constraint, i.e., compute joint 
axes, joint positions, joint errors, and so on. That is, all information needed in the Jacobian matrix and the 
corresponding error term for each constraint, including any auxiliary constraints are evaluated. 

Care must be taken in the evaluation since joint limits and motors depend on the joints they are attached 
to, and joint motors and limits must therefore not be evaluated before the joints they are attached to. 

After having performed the evaluation, we can determine which constraints are currently active and 
which are nonactive. The nonactive constraints can simply be dropped from further consideration in the 
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cuiTent assembly. As an example, the joint limit is nonactive whenever the joint has not reached its outer 
limit. 

Knowing how many active constraints there arc, we can compute the dimensions of the matrices 
involved. For each constraint we will query how many rows its Jacobian matrix contains and we will 
query how many auxiliary variables there arc. Summing these up we arc able to determine the total 
dimensions of the matrices and vectors needed in the assembly. 

During these summations it is also possible to assign indices for the subblocks for each and every 
constraint, where its Jacobian matrix, error term, and the auxiliary variable data should be mapped to. 
Figure 7.27 shows the pseudocode. Observe that substantial memory savings can be achieved by using a 
simple sparse matrix representation of the matrices M '' 1 and J, as shown in the pseudocode as well. We 
can now start tilling in data in the matrices M 1 and J, and the vector 6 error , together with the parts of 
the system matrix A and the right-hand side b containing the auxiliary data. Also, external forces, limits, 
and the generalized velocity vector arc easily dealt with as shown in Figure 7.28. Using these matrices it 
is quite easy to compute (7.223) in a straightforward manner. The only real difficulty is that one must be 
careful about the sparse representation of the matrices M 1 and J 1 . 

We have completed what we set out to do in this section. A unified framework for handling both 
contact point constraints with friction and joint constraints with both motors and limits have been derived. 

The framework allows a modularized and object-oriented design of all the constraints in a simulator. 
This object oriented design is outlined in Figure 7.29. In the figure we have omitted details regarding 
assembly of the jointed mechanism (linking bodies together with joints, and adding limits and motors). 

7.12 Modified Position Update 

When the new generalized position vector is computed as 

a t+At = s f + AtSu t+At , (7.226) 

then we call it a position update. In the position update written above, an infinitesimal orientation update 
is used. This is fast to compute, but can occasionally cause inaccuracies for bodies that are rotating at high 
speed; especially when joined to other bodies. 

For instance, in a car simulation, four wheels might be attached to a chassis with a wheel joint. When 
driving the car the wheels may rotate in incorrect directions, as though the joints were somehow becoming 
ineffective. The problem is observed when the car is moving fast and turning. The wheels appeal - to rotate 
out of their proper constraints as though the wheel axes have been bent. If the wheels are rotating slowly, 
or the turn is made slowly, the problem is less apparent. The problem is that the high rotation speed of the 
wheels is causing numerical errors. A finite orientation update can be used to reduce such inaccuracies. 
This is more costly to compute, but will be more accurate for high-speed rotations. High-speed rotations 
can result in many types of errors in a simulation, and a finite orientation update will only fix one of those 
sources of error. 

We will now outline a modified position update like the one used in the Open Dynamics Engine 
[ODE, 2005]. For each body B{ we want to be able at configuration design time to set a kind of bit flag 
indicating if the body should be updated using the infinite or finite update method. In case of the finite 
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Algorithm allocate (...) 

C = set of all constraints 

l Ijacobian — 0 

for each constraint c £ C do 
c.evaluate() 
if not c.active() then 
remove c from C 
end if 

c. set Jacobianlndex (rijacobian) 

Mjacobian+ = c . getNumberOf JacobianRows () 
next c 


^auxiliary — 0 

for each constaint c £ C do 

c. set Auxiliary Index (rijacobian + n aU x iiiary) 
Wauxiiiary+ = c • getNumberOf AuxiliaryVars ( ) 
next c 

J = mafrim/rijacobian, 12) 

terror = VCCtOT (rijacobian) 
ntotal = njacohian “h ^auxiliary 

A = matrix{nxoiai, n to tai) 
b = vector (ntotai) 

A = vector(ntotai) 

Aiow = vector(ntotai) 

Ahigh = vector(n to ai) 

B = set of all bodies 
ribody = B . size () 

Mmv = matrix(3,6nbody) 
u = vector (6ribody) 

/ext = vector (6n b0 dy) 

End algorithm 


Figure 7.27: Allocate system matrix and setup subblock structure for constraints. 
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Algorithm fillln(...) 

C = set of all active constraints 
for each constaint c £ C do 
i = c.getJacobianlndex() 
j = i + c.getNumberOfJacobianRows() 

J \in = J(i..j),( 0..2) 

Jlin — 5) 

Jang = 6..8) 

Jang = J(i..j),( 9..11) 
c. getLinear Jacobian_i (Jf^) 
c. getLinear Jacobian_j ( J J lin ) 
c. get Angular Jacobian_i (Ja ng ) 
c. getAngular Jacobian_j (Jl ng ) 
c . getErrorTerm (b er ror,(i..j)) 
c. get JacobianLowLimit (Ai ow , ) 
c. get JacobianHighLimit (Ahi g h,(i. .j)) 
next c 

for each constaint c £ C do 
i = c.getAuxiliarylndex() 
j = i + c.getNumberOfAuxiliaryVars() 

Arows = -4-(*..j),( 0 ..n toB1 _ 1 ) 

Acois = A( 0 .. ntotal _ 

c . getAuxiliaryRowsAndColumns (A r0W s, A co is) 
c . getAuxiliaryLowLimit ( ^iow ,) 
c . getAuxiliaryHighLimit (Ahi g h,(i..j)) 
c . getAuxiliaryRightHandSide (b(i..j)) 
next c 

B = set of all bodies 
for each body b £ B do 
i = 6 b.getlndex() 
j = i + 6 

b. getMassInvMatrix (M' inVj (j.. i+3j o.. 2 )) 
b. getlnertialnvMatrix (M inVi ( i+3 .. i+5i 3 .. 5 )) 
b. getLinearVelocity (u i..i+ 2 ) 
b. getAngularVelocity (u ;+ 3 ..i +5 ) 
b . getExternalForce ( /ex,, (*..*+2) > 
b. get External Torque (/ ex t,(i +3 ..i +5 )) 
b.getlnertiaMatrix (I) 
b.getAngularVelocity(w) 

/ext,(i+3..i+5) — = W X III! 
next b 

End algorithm 


Figure 7.28: Fill-in data in subblock structure for constraints and bodies. 
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Figure 7.29: The Constraints Design. Observe the unification of contacts and joints. 


update we want to compute a rotation, q, corresponding to the radians traveled around the rotation axis in 
the time interval A t, that is, the rotation angle 9 is given by 


6 = At ||u>i|| , 


where the rotation axis is 



(7.227) 


(7.228) 


The corresponding quaternion q is then given as 
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This can be rewritten as follows: 



cos 


At II ojj. 


At 


-ursine 


At || ur 


Introducing the notation h = At/ 2, and 6 = ||wi|| 2 h, we end up with 

q = [cos (6 ), cjjsinc ( 9 ) h] , 


(7.230a) 

(7.230b) 

(7.230c) 

(7.230d) 

(7.230e) 

(7.231) 


where 


sine (x) ~ 



if |x| < e, 

otherwise. 


(7.232) 


In order to avoid division by zero we have patched the sine function around zero by using a Taylor expan¬ 
sion for small values e = 10 -4 . Furthermore, we want to be able to do both a full finite orientation update 
and a partial finite orientation update. We have already taken care of the full finite orientation update, and 
for the partial finite orientation update we split the angular velocity vector into a component u> n n j| C along 
a specified finite rotation axis r ax j s , and a component to m [j n j te orthogonal to it, that is, 


^finite — (Taxis ' ^i) ^axis> (7.233a) 

^infinite = ~ ^finite- (7.233b) 


First, a finite update is done with tonniie followed by an infinite update done with <x> m n n jt e 

Qi = qqi, (7.234a) 

qi = qi T AtQjii) infinite- (7.234b) 

A partial finite orientation update can be useful in a situation like the previously mentioned wheel problem; 
we simply set the finite rotation axis equal to the hinge axis. Figure 7.30 shows the pseudocode for the 
position update on body Bi. 
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Algorithm position-update (i, At) 
ri = ri + A tvi 

if finite rotation onithen 

if finite rotation axis on«then 

^finite = (4 1 axis ' CV i ) r axis 
^infinite — OJi finite 

At = At/2 

0 — (T*axis ' Wi) At 
q s = cos (0) 

q v = (sinc(0)At) WMie 
else 

At = At/2 
0 = ||wi|| 2 * At 
q s = cos(0) 
q v = (sinc(0)At)u>i 
end if 
Q = [qs,qv] 

qi = qqi 

if finite rotation axis on«then 

qi = qi A I Q, CUinfinite 

end if 
else 

qi = qi + At.QiUi 
end if 

normalize (qi) 

End Algorithm 


Figure 7.30: Position update on i’th body. 


7.13 Constraint Force Mixing 

The constraint equation for a joint has the form 


Ju = b, (7.235) 

where u is a velocity vector for the bodies involved, J is the Jacobian matrix with one row for every 
degree of freedom the joint removes from the system, and b is the error correcting term. 

The constraint forces, that is, the reaction forces from the joint bearings are computed by 

F = J T X, (7.236) 

where A is a vector of Lagrange multipliers and has the same dimension as b. The Open Dynamics Engine 
[ODE, 2005] adds a new twist to these equations by reformulating the constraint equation as 

Ju = b — K cmi A, (7.237) 
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Where K cm f is a square diagonal matrix. The matrix K c m f mixes the resulting constraint force with the 
constraint. A nonzero (positive) value of a diagonal entry of K cm f allows the original constraint equation 
to be violated by an amount proportional to K cm f times the restoring force A. Solving for A gives 


Mu = M- 


u A t+t _ u 


At 


= J 1 A, 


(7.238) 


from which we isolate u At+t as 


u At+t = u + AtM- 1 J T \. (7.239) 

Assuming that the constraint equation holds at time t + At and substituting the expression for u At+t into 
the constraint equation, we get 


Ju At+t = b- K cmf A, (7.240a) 

Ju + AtJM~ l J T A = b- K cm{ A. (7.240b) 

Collecting A terms on the left side we get 

(^JM- 1 J T + ilf cmf jA=i(6-J U ) ) (7.241) 

from which it is clear that the K cm f matrix is added to the diagonal of the original system matrix. 

According to [ODE, 2005], using only positive values in the diagonal of K cm f has the benefit of 
moving the system away from singularities and improves factorization accuracy. 

This twist of the constraint equations is known as constraint force mixing, and the diagonal entries of 
the matrix K cm f controls the amount of mixing that is done. 

7.14 First-Order World 

First-order world simulation is greatly inspired by the way Aristotle saw and described the world at his 
time. These misconceptions were later rectified by Newton in his three laws of motion. Regardless, a 
first-order world is useful in animation for error correction and precise positioning of objects. Aristotle’s 
basic views were: 

1. Heavier objects fall faster. 

2. To keep an object in motion at constant velocity, a constant force is needed. 

The problem with the second statement was in not realizing that in addition to the pulling or pushing 
force, there arc other forces involved, typically friction and air or water resistance. Fortunately, these 
misconceptions arc of great practical use in computer animation and also in more serious applications 
such as virtual prototyping for example, [Redon et ah, 2003]. 
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♦ * * ♦ 


Figure 7.31: A sequence in a first-order world simulation share two identical objects that arc aligned. The 
left box is pulled to the right. Observe that both the left and right boxes arc equally affected. 


0 “ ^ 


Figure 7.32: A sequence in a first-order world simulation where two objects of different mass arc aligned. 
The left box has less mass than the right box. The left box is pulled to the right. Observe that the heavier 
box on the right is less affected than light box on the left in comparison with Figure 7.31. 

A first-order world is useful due to its controllability. In Newtonian mechanics, objects keep moving or 
sliding after user interaction, making it hard to accurately position objects interactively in a virtual world. 
On the other hand, in a first-order world, an object would stop immediately after a user stops pushing it. 

Ease of interaction can be obtained by merely translating objects, but such an approach does not take 
care of the rotational movement induced by collisions, etc. In a first-order world, misalignments between 
objects arc handled through simulation. 

For instance, if a light misaligned object is pushed against a heavy perfectly aligned object, then the 
simulation will automatically align the two objects w.r.t. each other. Furthermore, the light object will 
be aligned more than the heavier object. Figures 7.31 and 7.32 show a few frames of a first-order world 
simulation where two objects arc automatically aligned. As seen in Figure 7.31 and 7.32 a first-order 
world simulation is well suited for aligning objects and it respects their mass and inertia. This smooth 
alignment is very attractive in many virtual environments; this also makes a first-order world simulation 
an ideal tool for correcting simulation errors [Baraff, 1995]. An example of this is shown in Figure 7.33. 

In a second-order system we have a state function, s(t), for a rigid body 
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Figure 7.33: First-order world simulation used to correct penetration error. Left shows initial state, right 
shows the corrected state. Observe that when corrected, the upper box is both translated and rotated. 


where 


or equivalently, we have 


v 


—, and Ld = I l L, 

m 


r 


V 

q 

, and s(t) = 

\^q 

F 

V 

m 



I (t — LL> X L) 


(7.243) 


(7.244) 


r 

, and s(t) = 

F 

l r-i 

_q_ 


vh 1 T q\ 


In a first-order system, the state function for a rigid body simplifies to 

s(t) = 

Observe that the difference is that force relates directly to velocity, that is 

F = rnv. 

T = ILO. 


(7.245) 


(7.246) 

(7.247) 


This is the first-order world equivalent to the Newton-Euler equations of motion. Thus, in a first-order 
world there is velocity but no acceleration, and first-order worlds arc therefore very useful for evaluating 
system kinematics. In a first-order system, the dynamics equation F = rnv dictates that objects have no 
intrinsic velocity of their own; equivalently, the velocity at any given instant depends completely on the 
forces acting at that instant. This means that velocity-based damping and friction arc nonexistent in a first- 
order world, and that inertial forces due to velocity arc absent. As we will see later, these consequences 
greatly simplify the contact modeling compared to a second-order world obeying the Newton-Euler equa¬ 
tions of motion. 
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7.14.1 Single Point of Contact 

Now we'll study a single point of contact, p k , between two bodies i and j. Let r z and rj be the center of 
mass position of the bodies, then the two vectors from the center of mass to the point of contact arc found 
as 


r ki = Pk ~ r i J (7.248) 

r kj = Pk ~ r j- (7.249) 

The change in velocity of the contact point with respect to each body arc 

A Ui = A Vi + A Ui x r ki , (7.250) 

A Uj = A Vj + A ujj x r k j. (7.251) 


In a first-order world, a change in velocity corresponds to a change in force. For the time being we will 
ignore all other forces in the system except for the forces acting at the contact point. This means that the 
change in force is simply the force acting at the contact point, and for body j we have 

Avj = —, (7.252) 

rrij 

A wj = Ij 1 (r kj x F ), (7.253) 

where F is the force acting on body j. Assuming the law of reaction equals action, we must have that the 
force on body i is given by —F, yielding the following equations for body i 

A Vi = (7.254) 

mi 

Ac Ci = -II 1 ( r ki x F ). (7.255) 

Now let us compute the relative change in contact velocity as 

u = (Auj — A ui). (7.256) 


Substituting previous results yield 

u = ((A vj + A uj x r kj ) - (Avi + Aui x r ki )), 

U = ((^ + ( /jTl (rfci X F) ) X Tkj ) ~{~m + (rfci x F) ) x Vki )) 


(7.257a) 

(7.257b) 
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Using the cross matrix notation for the cross products we can isolate F as 

U = ((^ + rkj>< ) " + ( rfci><F )) rfci>< )) ( 7-258a ) 
= (Jr + F ~ ( rfc j x/ 7 lrfc i x ) F - (rM x ir lr i « x ) F ( 7 - 258b ) 

= (^- + ^-) F - (rfc/JTVfc/ + rfci x Jr 1 r fci x ) F (7.258c) 

= ((^ + ^) 1 - (rkj y IJ ] r kj y + r fci x Jr 1 r fci x )') F (7.258d) 

----- - -'' 

K 

u = KF. (7.25 8e) 

The matrix IT is the familial - collision matrix [Mirtich, 1996], and we have derived the same relationship 
between force and contact point velocity in a first-order world as we have done earlier for collision im¬ 
pulses and contact point velocities in a second-order world. Measuring the relative contact velocity in the 
normal direction only n, yields 

u n = n T u = n T ( KF), (7.259) 

and restricting the contact force F, to be parallel to the normal direction is simply 

u n = (n 1 Kn ) /, (7.260) 

where / is a scalar denoting the magnitude of the force F, and u n is likewise a scalar denoting the 
magnitude of relative contact velocity in the normal direction. 


7.14.1.1 Penetration Correction 


The previously derived relation (7.260) provides a convenient way for projecting penetrating bodies out 
of each other. The idea is as follows: set up (7.260), such that you solve for the force F, which will yield 
a change in the relative contact velocity u , that will correct the penetration during the next time-step h. 

Let the penetration depth be given by the distance d, then the correcting velocity at the contact points 
must be 

U n = y, (7.261) 

h 

and solving for the correcting force yields 


d 

f = —-—. 

n T Kn 


(7.262) 


Now use F = nf in the state function (7.245) to perform a single forward Euler step. This yields a 
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position update, which will correct the penetration between the two objects, that is, 

r l + l =r t i + h^-, (7.263) 

rrii 

7* +1 = 4 + h\ (I” 1 ( r ki x (-F))) qt, (7.264) 

rf 1 =r t j + h—, (7.265) 

J J rrij 

q* +1 = 4j + h l - (ij 1 (r kj x F)j q). (7.266) 

In the above we have applied a forward Euler scheme, meaning that we have linearized the sought dis¬ 
placements. The correction is therefore only correct to within first-order, but in practice this is seldom a 
problem, since most simulation paradigms try to prevent penetrations, indicating that penetration errors 
arc likely to be small and only require small corrections. 


7.14.1.2 Continuous Motion 

The constraint forces needed in a first-order world simulation can also be derived from (7.258e). Taking 
external forces into account, say forces F l ext and F J ext and torques r* xt and T J ext working on bodies i and j 
respectively, then it is seen by substituting the forces into (7.257a) that (7.258e) changes into the form 

u = KF + b, (7.267) 


where b is given by. 


b = 


F - 1 

± ext 
rrij 


^ ext 

rrij 


I ■ 

' j ex 


X r kj - li 1 rl xt X r kl . 


(7.268) 


Here F denotes the constraint force at the contact point, which must prevent penetration from occurring. 
This implies that the relative contact velocity in the normal direction must be nonnegative, since a first- 
order world is frictionless. According to the principle of virtual work it makes sense to require that the 
constraint force is parallel to the normal direction; also the bodies are allowed to separate, which means 
that the constraint force can only be repulsive. To summarize we have 


u n > 0, and / > 0. 


(7.269) 


Furthermore, u n and / must be complementarity, meaning that if u n initially is greater than zero, then 
there is no need of a constraint force, that is, 


% > 0 / = 0. (7.270) 

On the other hand, if we have constraint force acting at the contact point, then the normal velocity must 
be zero, that is, 

/ > 0 => u n = 0 . 

In conclusion, we discover a linear complementarity condition similar - to (7.21). 


(7.271) 
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7.14.2 Multiple Contact Points 

In this section we will extend the machinery for a two body problem to handle an n-body system. 
Let u k denote the change in relative contact velocity and at the k’th contact, then 


As previously, we also have 


Ufa — Auj k Aui k . 

A u ik = Av ik +Au> ik x r kik , 
Au jk = Av jk + Au jk x r kjk , 


(7.272) 

(7.273) 

(7.274) 


however this time we do not have a single contact force contributing to the fc’th contact. To get the total 
force and torque contributing to the fc’th contact we have to sum over all forces and torques. Using the 
indexing introduced in Section 7.1 we write 


A v jk = 


-Cext + Ylh,j h =j k Fh Ylh,i h =i k Eh 


A^ifc = IJ h L 


Jfc I 
1 ext ' 


r hj h x Fh I I f'hib X Eh 

h,jh=jk ) \h,ih=jk 


(7.275) 


(7.276) 


where F k is the contact force at the K h contact point. If we perform the same substitutions as done in the 
case of a single point of contact (7.267), then the contributions from the external forces and torques can 
be collected in a single vector b k . The contribution stemming from the contact forces can then be written 
as a lineal - combination. 


where 


Ki,h = 


Uk = K kh F h j + b k , 

+ ^~) 1 - ( r k Jk x Ij l ' r hjk x + r klk y I- ] r hik y ^ if i h = i k and j h = j k , 

1 + ( r ki k x I~ k r hih x ) if i h = i k and j h ± j k , 

> \ 

) 1 + \ Tkj k x I jk r hi h x ) if ih = jk and j h ± i k , 

) 1 - ( r ki k xx ) if j h = ik and i h + j k , 

4r) 1 - ( r kj k x !j k rhj h x ) if jh = jk and i h ± i k , 


(7.277) 


(7.278) 


[ 0 otherwise. 

If we only consider relative contact velocity in the normal direction, and only allow contact forces to be 
parallel with the contact normals, then it is easily seen that the equation transforms into 


(7.279) 


Unk =n k u k = n T k K kh n h f h + b k , 
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where // l denotes the magnitude of the /Tth contact force. The equations for all the relative normal contact 
velocities can now be written as in a single matrix equation. 


UnO 

— 

n^K 00 n 0 

no KoK-inn-i 


' fo ' 

+ 

1- 

o 

-O 

ho * * * 

s 

1_ 

UnK- 1_ 

s. 

nj^_ i K x— 10^0 • 

■ ■ n^^K K-iK-\nK-\_ 


Jk-\_ 

y ''Si 

n] < _ l b K -i\ 

a 



A 


X 


b 


=> a = Af + b. (7.281) 


This equation is a si mi lar equation to (7.25), and recalling the definition of the contact Jacobian in Sec¬ 
tion 7.7.7 and the assembly process in Section 7.11, we immediately generalize. 


A = 
b = 


J contact 
J contact 


M~ l 


J 


T 

contact ? 


M~ 7 ext 


(7.282) 

(7.283) 


Note that in a first-order world / ext does not contain inertial forces. An LCP can now be formulated for 
the continuous movement of a first-order world as 


a > 0, f > 0, and f 1 a = 0. 


(7.284) 


A penetration correcting force can be computed by redefining the b- vector 


b 


1 

h 


—do 

—dx-i 


(7.285) 


and then solving the same LCP as we formulated above. For the case of a single-point contact, we could 
solve the linear system. But for multiple contact points, the projection of a penetration of one contact point 
might fix the penetration of another, and this induces a complementarity condition where there can be no 
correcting force if there is no penetration. On the other hand, if there is a correcting force, the penetration 
depth must be zero. 


7.15 Previous Work 

Impulse-based simulation was introduced by Hahn [Hahn, 1988], where time integrals of contact forces 
were used to model the interactions. In recent times, Mirtich [Mirtich, 1996] worked with impulse-based 
simulation using a slightly different approach, where contact forces were modeled as collision impulse 
trains. Hahn’s and Mirtich’s work represent two different ways of thinking of impulse-based simulation: 
as a time integral and as a sum of delta functions approximating the corresponding time integral. Mirtich’s 
work led to a new simulation paradigm as discussed in Chapter 6. Hahn's work could be interpreted as an 
early predecessor to the velocity-based formulation. 
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Stewart and Trinkle [Stewart et al., 1996, Stewart et al., 2000, Stewart, 2000] made a new impulse- 
based method. Their method is now known as Stewart's method. Like Hahn’s method, Stewart’s method 
also computes the time integrals of the contact forces, and it has since inspired many: Anitescu and Potra 
[Anitescu et al., 1997] extended the method to guarantee existence of the solution; Sauer and Schomer 
[Sauer et al., 1998] extended with a linearized contact condition; Song et al. [Song et al., 2003] used a 
semi-implicit time-stepping model for frictional compliant contact; and most recently, an implicit time¬ 
stepping method for stiff multibody dynamics by Anitescu and Potra [Anitescu et al., 2002] and Hart and 
Anitescu introduced a constraint stabilization [ Hart et al., 2003] method. 

Even though Stew art’s method is an impulse-based method, it looks and feels like a typical constraint- 
based method, such as the one formulated by Baraff [Baraff, 1994, Baraff, 2001], Trinkle, Pang, Sudarsky, 
and Lo [Trinkle et al., 1995], Trinkle, Tzitzoutis, and Pang [Trinkle et al., 2001], and Pfeiffer and docker 
[Pfeiffer et al., 1996a]. These are termed force-based formulations and Stewart’s is termed an impulse- 
based or velocity-based formulation. 

Stew art and Trinkle originally used position constraints in their formulation; these position constraints 
ensure nonpenetration, but suffer from existence problems unless all contact normals arc linearly indepen¬ 
dent. 

Anitescu and Potra’s velocity-based formulation always guarantees a solution, but there is one draw¬ 
back: penetrations might occur when the system moves along concave boundaries of the admissible region 
in configuration space. Another side effect from the velocity-based formulation is that special attention 
must be added to separating contacts moving tow ard each other. If these arc present, they will be treated 
as if they were in colliding contact; this could leave objects hanging in the air. Anitescu and Potra pro¬ 
pose a time-stepping algorithm that correctly handles this side effect, but other authors tend to forget the 
problem. 

Stewart and Trinkle suggest that simple projection can be used to eliminate this problem, but care 
must be taken to avoid losing energy in the process. However, they do not mention how one should do the 
projection. 

Stewart and Trinkle [Stewart et al., 2000] suggest using contact tracking (it could be coined “retroac¬ 
tive detection” as well) to avoid the problem of explicitly determining all potential active contact con¬ 
straints. That is, once a solution is found, the constraints at time t + At arc checked. If any one is found 
to be violated it is added to the set of active contact constraints and the problem is resolved once more. 
This of course, increases the amount of computations; often this is undesirable. For instance, in the Open 
Dynamics Engine, a fixed time-stepping advancement is chosen and any errors occurring arc reduced by 
using penalty forces. The strategy is: don’t tty to prevent errors, but fix or reduce them if they occur. 
Another issue (which is not obvious, at least to us) is: is it possible to use a retroactive detection of con¬ 
tact constraints when we know there is a problem with concave boundaries of the admissible region of 
configuration space? 

Sauer and Schomer [Sauer et al., 1998] use a linearized contact condition; they say this is for potential 
contact constraints and they require the assumption that during their fixpoint-iteration contact constraints 
do not change topologically, i.e., (E, E) i—> ( V ., E). It is not obvious to us why this is required. 

In the summary section of Stewart and Trinkle’s paper [Stewart et al., 2000], there are some thoughts 
on how their method can be extended to handle a nonzero coefficient of restitution. They suggest stopping 
at collision times, which in our opinion is not an attractive choice. Actually their thoughts are based on the 
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work by Anitescu and Potra. In the Open Dynamics Engine a slightly different approach is used similar 
to [Baraff, 1989]. 
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Deformable objects cover all object types that arc not rigid, and thus span a wide range of different 
object types with different physical properties, such as jelly, cloth, biological tissue, water, wind, and tire. 

Deformable objects arc considered more difficult to deal with than rigid body simulation for several 
reasons. First, is the complexity of variables needed to describe the motion of a single object exploded 
in magnitude compared to a single rigid object. For a rigid object, only the motion and orientation of 
its center of mass arc needed; for a deformable object every little particle needs to be described. Sec¬ 
ond, since deformable objects could potentially self-intersect, both the geometric problem of detecting 
self-intersections and the collision resolving arc unpleasant problems to deal with. Third, the physics of 
deformable objects is described by a constitutive law describing relationships between physical properties 
inside the object, such as stress and strain. Although a constitute law often can be recognized as New¬ 
ton’s second law of motion, a student often needs to learn a multitude of new mathematical notations, 
terminology, and physical properties, such as calculus of variation, stress, strain, etc., in order to master 
its application to deformable objects. 

The dynamics of deformable objects have been well known by physicists for the last 200 years or 
so. The mathematical models arc well developed and the engineering community has been simulating 
computer models of deformable objects almost since the beginning of computers. Compared with this 
long history, it is not until recently that physically based animation of deformable objects has been in¬ 
troduced to the computer graphics community by the pioneering work of Terzopoulos el al. in 1987 
[Terzopoulos et al., 1987]. 

Computer animation differs from traditional engineering applications in several aspects. First, the 
computational resources arc low and the demand for fast performance and high visual details arc weighted 
much higher than accurate computation. Second, computer animation is more concerned with showing 
interesting motion such as object deformation or fracture, than with the end results. Computational grids 
in computer animation arc extremely coarse compared to the grids used in engineering, and stability and 
robustness is strongly favored over accuracy and precision. Consequently, computer animation is often 
based on implicit time-stepping methods with large time-steps, rather than explicit methods with small 
time-steps as preferred in engineering disciplines. Finally, computer animation is about making tools such 
as Maya® or 3DMax® for artistic and creative people, which have little respect for the real world. That 
is, a material may be given unrealistic properties, which produce an entertaining motion on a computer 
screen, but which do not correspond to any existing materials. Furthermore, objects may be forced into 
unnatural poses, simply because the poses are funny. The implication is that although physics-based 
animation uses physics as its foundation, physics-based animation must be able to cope with unphysical 
situations robustly. 

In the beginning, people used rather simple constitute models in computer animation. Simple lin¬ 
eal - models between stress and strain of isotropic materials were applied. The quest for higher realism 
has moved the computer graphics community toward more advanced constitute models, since in anima¬ 
tion large deformations are sought, and materials found in nature exhibit complex nonlinearities under 
large deformations. Recently, there has been an increased focus in the computer graphics community 
on improved handling of large deformations. The constitutive laws are often solved on mesh grids, but 
simulating large deformations can severely damage the integrity of a mesh grid. For instance, mesh ele¬ 
ments can be inverted or entangled. There have been attempts to remodel the underlying physical models 
by introducing force-terms that counter these effects, or strategies for dynamically remeshing or simply 
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Idealization Discretization Solution 



Figure II. 1: Analysis process in computational mechanics. 


deleting erogenous mesh elements. However, many of these solutions exhibit visual artifacts, such as 
mesh elements that pop out during simulation. 

Deformable objects in computer animation have found their way to the movie theater: movies such 
as Monsters Inc.® use implicit constraint-based cloth animation [ Bara IT et al., 1998] with untangling of 
mesh elements [Baraff et al., 2003b], Phantom Menace® uses a guaranteed non self-intersecting time¬ 
stepping method for cloth animation [Bridson et al., 2002], and The Rise of the Machines® used a fold 
preserving and increasing method [Bridson et al., 2003]. In computer games methods based on over- 
relaxation have become widely popular [Jakobsen, 2001]. 

The scope on the theory of deformable objects in this book is to establish the basic foundation. Our 
main intention is to focus on describing the constitutive models and their implementation. We refrain from 
the problems of mesh inversion and entangling and refer the interested reader to [Irving et al., 2004] and 
| Baraff et al., 2003b]. 

The field of mechanics can be divided into four different subparts: theoretical, applied, computa¬ 
tional, and experimental. In theoretical mechanics, the fundamental laws and principles of mechanics 
are studied. In applied mechanics, the theory is applied to science and engineering applications by con¬ 
structing mathematical models of physical phenomenon. In computational mechanics, actual problems 
arc solved by combining the mathematical models with numerical methods, leading to an implementation 
on a computer that is then simulated. In experimental mechanics, observations of the physical laws, the 
mathematical models, and the numerical simulations arc compared. In computer animation, the held of 
computational mechanics is used together with a plethora of simplifying assumptions that lead to faster 
simulations. 

The held of computational mechanics is highly interdisciplinary It uses applied mathematics and nu¬ 
merical analysis, computer science, and theoretical and applied mechanics to arrive at a discrete model 
that can be solved on a computer. The analysis process that leads to a discrete model is illustrated in Fig¬ 
ure II.l. Going from a physical system to a mathematical model is a process known as idealization. This 
stems from the fact that the real world is seldom ideal, especially when trying to express a phenomenon 
as a mathematical equation. The problem is expressed in an ideal situation that allows it to be formulated 
mathematically. Discretization is the process of reducing the continuum of the mathematical model to a 
hnite algebraic system. In Figure II.l three subpaths are shown at the discretization: Strong form ( SF ), 
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\ 


Weighted Residual 


Raleigh-Ritz 

4 


FEM 

4 

Galerkin 
Least Squares 
Subdomain 


Figure II.2: Discretization of strong form, variational form, and weak form. 


Weak form ( WF ), and Variational form (VF). Strong form is a system of ordinary or parti al differential 
equations in space and/or time together with a set of boundary conditions. It is called strong because the 
system is satisfied at each point of the problem domain. Weak form is a weighted integral equation that 
relaxes the strong form in a domain-averaging sense. Variational form is a functional whose stationary 
conditions generate the weak and the strong forms. Variational calculus is used to pass from one form to 
another: strong to weak form, or the other way around, variational to weak form, or variational to strong 
form. Going from variational to strong form is essentially what is taught in variational calculus, whereas 
the inverse problem of going from strong to variational form is not always possible. However, sometimes 
it can be done, as will be shown for a simple example below. 

Problems stated in strong form arc a well-established pari of calculus and mathematical physics. As 
an example, Newton’s second law F = ma is a strong form equation. The reason that the variational and 
weak forms arc used instead of the strong form is the following: a variational form functionally expresses 
all properties of a model: field equations, natural boundary conditions, and conservation laws. The func¬ 
tionals arc scalar's and therefore invariant to coordinate transformations, so variational form transforms 
between different coordinate systems. Both variational form and weak form arc used in discrete meth¬ 
ods of approximation on computers. Variational form and weak form characterize quantities like mass, 
momentum, and energy. Variational form and weak form lead naturally to conservation laws. Varia¬ 
tional form and weak form systematize the treatment of boundary and interface conditions with discrete 
schemes. 

The continuum mathematical models in strong form, variational form, or weak form initially have an 
infinite degree of freedom. The discretization process reduces this to a finite set of algebraic equations 
that can be solved in reasonable time. The discretization process for the different forms leads to different 
approximation methods as characterized in Figure II.2. The strong form is usually discretized using the 
Finite Difference Method (FDM ) where the derivatives arc replaced with differences. This is an easy 
process for regular - domains and boundary conditions, but becomes complicated for arbitrary geometry of 
boundary conditions. The weak form is discretized using the Weighted Residual Method. This consists 
of a lot of subclasses with names like Galerkin, Petrov-Galerkin, Subdomain, Finite Volume Method, and 
Method of Least Squares. All these subclasses arc called trial function methods. The weighted residual 


© 


© 


© 


© 




“book” — 2005/9/30 — 15:44 — page 262 — #274 


262 


THE DYNAMICS OF DEFORMABLE OBJECTS 


method gives approximate solutions everywhere in the domain. In earlier times, the weighted residual 
method was solved by hand but the invention of the Finite Element Method (FEM ) made a computer 
implementation possible. The variational form is discretized using the Raleigh-Ritz method , which is a 
special subclass of the Galerkin weighted residual method. Originally, the finite element method was 
developed in this way. 

To illustrate the different terminology introduced above, consider the following simple example. Let 
y = y(x) be a function satisfying the following ordinary differential equation 

y" = y + 2, 0 < rc < 2. (II.l) 


This is a strong form since the ordinary differential equation (ODE ) is to be satisfied at each point in the 
problem domain x G [0,2]. Equation (II.l) alone is not sufficient to specify a solution. It needs to be 
paired with a set of boundary conditions. Two examples arc 

2/(0) = 1, 2/(2) = 4, (II.2a) 

2/(0) = 1, 2/(0) = 0. (IL2b) 


Equation (II. 1) together with the boundary conditions specified in (II.2a) defines a boundary value problem 
(BVP ), which typically models problems in space. Equation (II. 1) together with the boundary conditions 
specified in (II.2b) defines an initial value problem (IVP ), which typically models problems in time. 

Consider the boundary value problem consisting of (II. 1)) and (II.2a). This is the top box in Figure II.3. 
The residual is defined as r(x) = y"—y—2. The strong form in (II.l) is equivalent to r(x) = 0. V.x G [0,2]. 
The residuals for the boundary conditions can be written as r 0 = 2/(0) — 1 and r 2 = 2/(2) — 4. 

Now, multiply the residual r(x) by a weighting function w(x) and integrate over the domain [0,2], 
multiply ro, by weights wq,W 2 , and add them to get 

r(x)w(x)dx + xqWq + r 2 U )2 = 0. (11.3) 

The problem is now on weighted integral form, which is a weak form. The reason for calling it weak is 
that functions other than (II.l) with boundary conditions (II.2a) may satisfy the weak form in (II.3). This 
is shown in the top paid of the middle box of Figure II.3. 

If the weight functions w,wq,W 2 arc written as variations of functions v, vq, ry. the weak form in 
(II. 3) is converted to the variational statement 

f 2 

/ r(x)5v(x)dx + vq5vq + r 2 $V 2 = 0, (11.4) 



where 6 is the variation symbol. The functions v arc called test functions. This is shown in the bottom 
part of the middle box in Figure II.3. 

For this example, the inverse problem of variational calculus has a solution. The functional 


J [y] = ^ \ (y"f - \y 2 + ‘*y 


dx, 


(II. 5) 


restricted to y’ s that satisfies the boundary value problem is stationary in a variational calculus sense. 
Equation (II.l) is called the Euler-Lagrange equation of the functional in (II.5). This is shown in the 
bottom box of Figure II.3, which is now completely described. 
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Figure II.3: All the forms used in the simple example. 
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Particle Systems 


Particles arc the easiest objects to animate. They arc objects that have position, velocity, and mass, but 
no spatial extend, i.e., neither moment of inertia nor torque. Particles have been used to animate a large 
variety of behaviors, such as gases, water (see Chapter 11), fire, rubber, clothes, flocking, rigid bodies, 
etc. Several sections in this chapter arc given in the spirit of [Baraff et al., 2003a, Bara IT et ah, 1998]. 


8.1 Newtonian Particle Systems 

Newtonian particles arc the most common and arc governed by Newton \s second law 

f = mr r = f /m, (8.1) 

where r(t ) = [x\,X 2 ,xf\ T is the position of the particle at time t, and r = [xi,X 2 ,xs] 7 = d is the 
instantaneous acceleration of the particle. The equation, r = f /m, contains a second-order derivative, 
hence it is a second-order differential equation. First-order equations are the simplest to solve, and any 
lineal - higher-order differential equations can be converted to a system of coupled first-order differential 
equations. 

Newton’s second law (8.1) is converted into two coupled first-order differential equations by projecting 
the particles into phase space where a point in phase space is denoted by its position and velocity: 



(8.2) 


The velocity is naturally found as the instantaneous change of position, r = v = and when r 6 K 3 
as typical in 3D computer graphics, then v e M 3 , and hence p 6 M 6 . 

The second-order differential equation is changed into two first-order differential equations, one for r 
and one for v, which are coupled through r = v. The instantaneous change is thus 


V 


r 


r 

a 


V 


r 


(8.3) 


In a simple explicit solution of this system of first-order differential equations, the velocity at time t is 
used to update the particle position for time t + h, while the acceleration at time t is used to update the 
particle velocity for time t + h. Hence, the effect of setting forces at time t will first have an influence on 
the particle’s position at time t + 2 h. This will be discussed in detail in the following sections. 
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8.2 Solving Ordinary Differential Equations 

For a system of particles, we assume that the forces arc given for all relevant interactions and positions, 
therefore we know the accelerations acting on any particle at any time. We will also assume that the initial 
positions and velocities for all particles arc known. The problem is now an initial value problem on the 
form p(t) = F(p(7)) under the condition p(/o) = Po = \ r o- ro] T , where we wish to solve for some 
t > 0. Most initial value problems in particle animation do not have a closed form solution; instead they 
must be solved by discrete approximations, such as Euler’s explicit or implicit scheme. These techniques 
arc discussed in detail in Chapter 23. The simplest method for solving first-order differential equations is 
by explicit Euler integration ; this is the only method that we will discuss here. 

Assume that time is discretized as follows: 


ti+l — h + ti. 


(8.4) 


A second-order accurate approximation of the position at time t l+ \ is given by the first-order Taylor Series 


p{U+i) = p(U) + hp(ti) + <D(h 2 ). 


Since we know the initial value of p(/o) and since we know F(p(/)) 
approximate p(f \) as 

P(*i) - p(*o) + /iF(p(t 0 )), 


which is equivalent to 


n 


ro 

+ h 

ro 

ri 


jo_ 



The following steps arc thus calculated as 


Vi+l 


ri 

+ h 

ri 

1 _ 


r t _ 



(8.5) 

[: x,f(t)/m\ T , then we may 

(8.6) 

(8.7) 


( 8 . 8 ) 


It should be noted that there is a time difference between the solutions of the position and velocity differ¬ 
ential equation, since the velocity calculated at time t j is used to update the position of the particle at time 
ti- |_i- This is the price paid for solving higher-order differential equations as a system of coupled first-order 
differential equations, and for animation purposes it is usually not a problem. Explicit Euler solutions to 
ordinary differential equations arc very simple to implement, and it is very fast to compute each iteration. 
However, they arc neither stable with respect to large steps, h, nor very accurate, which is demonstrated 
in Figure 8.1. Figure 8.1(a) shows the iso-energy values of a rotational symmetric system. Assuming that 
the speed is initially zero, then the correct solution would be the particular' circle that the particle was 
positioned on initially. However, the second-order Taylor approximation extrapolates the solution with a 
finite step size, which implies that the particle will leave whichever circle it is currently on. Instability 
is demonstrated in Figure 8.1(b). Again, assuming that the initial speed is zero, then the correct solution 
is the iso-energy lines shown in the figure and the correct solution is the convergence of the particle on 
the center horizontal line. However, if the step size is set too high, then the particle will diverge in this 
particular example, since the Taylor approximation will extrapolate to positions with increasing forces. 
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(a) (b) 

Figure 8.1: Euler’s method is neither accurate (a) nor stable (b). 


The phase-space solution to the second-order differential equation simultaneously solves for position 
and velocity. Using first-order Euler integration. 


r(t + h ) 


r(t) 

+ h 

'v(t) 

v(t + h) 


v (t)_ 

a {t)_ 


+ 0(h 2 ), 


Verlet [Verlet, 1967] suggested simplifying using the approximation. 


r(t — h) = r(t) — hv(t ) + ^ h 2 a(t ) + 0(h 3 ) 

=>■(«) = rW ~^~*) + 5M0 + o(* 3 ). 


which may be inserted into a second-order integration 

r(t + h) = r{t) + hv(t) + i h 2 a(t ) + 0(h 3 ) 

= —r(t — h ) + 2 r(t) + h 2 a(t ) + 0(h 3 ). 


(8.9a) 

(8.9b) 


(8.10a) 

(8.10b) 


This is called Verlet integration. A consequence of the scheme is that velocity is represented implicitly by 
the previous and current positions. 

An important paid in physics-based animation is collision detection and response. A particularly 
simple solution exists for Verlet integration, since Verlet integration does not store velocities. Hence, when 
a particle is found to penetrate an obstacle, then a simple collision response projects out of the obstacle 
along the shortest path. The cancellation of the velocity normal to the obstacle surface is thus automatic 
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Figure 8.2: A particle trajectory colliding with a wall. When a particle moves into a wall, it is projected 
the shortest distance out again. 


in Verlet integration as illustrated in Figure 8.2! The idea was introduced to the graphics community 
by [Provot, 1995] and discussed in [Baraff et al., 1998]. Later [Wagenaar, 2001] extended the method, 
which was put to use in [Jakobsen, 2001]. In the graphics community, the idea was extended into the 
impulse-based domain of cloth simulation by [Bridson et al., 2002]. With some freedom of interpretation 
you could even claim that the shock propagation from [Guendelman et al., 2003] is related to this way of 
thinking about constraints. 


8.3 Hooke’s Spring Law 

The motion of two particles connected with a spring is governed by Hooke’s Spring Law (Robert Hooke, 
1635-1703), and it is given as 


rriiXi = k s ( Xj — Xi) and rrijXj = —k s ( Xj — Xi) . (8.11) 

where k s is the spring coefficient, and where we have aligned the x-axis of the coordinate system along 
the spring. This may be reformulated as 



“book” — 2005/9/30 — 15:44 — page 269 — #281 


8.3 PHYSICS-BASED ANIMATION 


269 


using X — X j X 2 and p, = and is called the reduced mass. Hence, Hooke’s undamped spring law 

is essentially a one-object harmonic oscillator harmonic oscillator as discussed in Section 22.4. A one- 
object harmonic oscillator, such as a single particle connected to a wall with a spring may be described as 


/spring — k s X. (8.13) 

Newton’s second law of motion of a particle with mass m is then 

mx = — k s x =>• mx + k s x = 0, (8.14) 

and the analytical solution may be found to be 

x = B sin (ujq t) + C cos (tuo t) (8.15a) 

= A cos (coot + (f >). (8.15b) 

The natural frequency is given as, (Jq = B and C are constants determined by initial conditions , 

and <f> is the corresponding phase. 

The harmonic oscillator will continue its movement indefinitely unless it is interfered with by other 
objects. This is unrealistic, since in the real world, springs lose energy due to the production of heat 
through the movement itself and air friction. This dissipation of energy is often modeled using a damping 
viscosity force 

which implies that 


for which the analytical solution is 


using 


F lamping — 

-k d x, 

(8.16) 

mx = - 

-k s x — 

k d x 

(8.17a) 

=>■ mx 

+ k d x 

T k s x = 0 

(8.17b) 


kd . 

k s 

(8.17c) 

=> x + 

—x + 

—x = 0, 


m 

m 


= A exp ^ 

-i‘) 

COS (bJ\t + f ), 

(8.18) 

k 

—, 7 = 

m 

b 

m' 

= \/ w o - X’ 

(8.19) 


and A and f arc constants determined from initial conditions. Solutions arc only real valued when c/ — 

2 

if > 0. Writing the damped harmonic oscillator as 

x = A ( t ) cos (uj\t + <f) = A exp cos (uit + <jj) (8.20) 

and comparing to the undamped harmonic oscillator (8.15b), we see that 

• the amplitude of the damped oscillator is exponential decaying. 

• the frequency lo\ of the damped oscillator is less than the frequency, ccq, of the undamped oscillator. 
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Figure 8.3: An energy function and the corresponding potential field. The positive slope acts as an attractor 
and the negative slope is a repellent. 


8.4 Particle Forces 


The behavior of particles is controlled through the forces f. Typical forces arc potential forces, which 
arc gradients of energy functions. An example of an energy function and its potential field is shown in 
Figure 8.4. If no other forces arc acting on the particle, then it will follow curves, where the potential field 
on every point on the curve is tangent to the curve. 

Particle systems arc conceptually simple; there arc many systems, that with clever applications, make 
use of only a few types of force fields. These arc usually grouped into the following categories: unary 
forces, which act independently on each particle, for example, 


Gravity 


/ 


0 

~9 , 
0 


( 8 . 21 ) 


Viscous Drag 


/ = -k d r, 


( 8 . 22 ) 


Air Drag 


f = -k d \\r\\ 2 r, 


(8.23) 


where the second coordinate is assumed to be the vertical direction, g is the gravitational strength, and k d 
is either the viscosity or air friction coefficients'. N-ary forces which arc the interaction between groups of 
particles, for example, 









Figure 8.4: An example of a system of particles, where two particles, a and b, arc connected to each other 
and immobile points of a wall at c and d. All forces arc spring or gravity forces. 

Hooke’s Law for Damped Spring 

fa = ~fb = ~ MW ~ R) + jjp (8.24) 

where a and b are two particles connected with a spring, k s is the spring constant, 1 = r a — r& is the vector 
connecting the two particles, 1 = v a — Vb is the vector of instantaneous change, and R is the length of 
the spring at rest. The final category is spatial interactions, which arc interactions on single or groups of 
particles that depend on their spatial position. Some of these forces arc illustrated in Figure 8.4 

8.4.1 Deformable Surfaces 

Classically, cloth was modeled by using mass-spring systems [Provot, 1995]. These arc essentially particle 
systems with a set of predefined springs between pairs of particles. The cloth is modeled as a regular 2D 
grid where the grid nodes correspond to particles. The cloth can be animated by moving the particles, 
which will deform the regular grid. 

Springs arc not created at random between the cloth particles, but a certain structure is used to mimic 
cloth properties. Three types of springs arc used: structural springs, shearing springs, and bending springs. 
These spring types arc illustrated in Figure 8.5. 

Structural springs arc created between each pair of horizontal and vertical particle neighbors in the 
regular 2D springs. The purpose of structural springs is to resist st re telling and compression of the cloth. 
This kind of spring is usually initialized with very high spring coefficients and with a rest length equal to 
the interparticle distance. 
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Figure 8.5: Structural, shearing, and bending springs in a cloth mesh. 


Shearing springs arc used to make sure that the cloth does not shear, i.e., a square piece of cloth 
should stay squared and not turn into an oblong diamond shape. The spring coefficient of these springs is 
usually set less than the structural springs. Shearing springs are created between diagonal direction in the 
regular grid. Thus a particle at grid location (i, j) will have shearing springs to particles at grid locations: 

Bending springs arc inserted to make sure the cloth will not have sharp ridges when folded. These 
springs arc usually not very strong, since natural cloth does not strongly resist bending. The springs 
arc created along the horizontal and vertical grid lines, but only between every second particle. That is, 
particles at grid location i,j arc connected with bending springs to particles at grid locations: (i + 2, j), 

(* - 2, j), (i,j + 2), and (i,j - 2). 

It is straightforward to create these springs for a rectilineal - grid of particles and it offers an easy way 
to model a rectangular piece of cloth. Spring coefficients can be tuned to model different kinds of cloth 
types: cotton, polyester, etc., as discussed in [House et al., 2000]. 

One deficiency with modeling cloth this way is that it is limited to rectangular pieces of cloth. This is 
very limiting when, for instance, modeling animation cloth on an animated person. Nevertheless, we will 
focus on a generalization made in [OpenTissue, 2005], which can be applied to unstructured meshes such 
as arbitrary triangle meshes. 

The generalization is based on the observation that structural springs correspond to a 1-neighborhood 
stencil in the rectilineal - grid; shearing and bending springs correspond to a 2-neighborhood stencil. Simi¬ 
lar neighborhoods can be defined in an unstructured mesh. We let the vertices of the mesh correspond to 
particles, the cost of every edge in the mesh is set to 1, and for each particle the shortest paths to all other 
particles is computed. If we want to create 1-neighborhood-type springs, then springs are created to all 
particles having a cost less than or equal to 1. If we also want to add 2-neighborhood-type springs, then 
springs are created to all particles with a cost less than or equal to 2. 3-neighborhood springs and so on 
are created in a similar fashion. Figure 8.6 illustrate the springs created for a single chosen vertex. 

Generally speaking, the larger the neighborhood size used, the more rigid the resulting object appeal's 
to be. In fact, if the neighborhood size is large enough, springs will be created between every pair of 
particles in the mesh, and if the spring constants also are large, then the object will appeal - completely 
rigid. Thus we refer to the neighborhood size as the rigidity factor. The spring creation method was 
named surface springs in [OpenTissue, 2005]. 
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(a) 0 neighborhood 


(b) 1 neighborhood 


(c) 2 neighborhood 


Figure 8.6: Surface springs (thick lines) created for a chosen mesh vertex. 


The path cost computation can be implemented elegantly by performing a breadth-first traversal on 
each particle in the mesh. Once the traversal reaches particles lying further away than the rigidity factor, 
the traversal is stopped. Figure 8.7 illustrates a pseudocode implementation of the breadth-first traversal 
algorithm for creating surface springs. Observe that creating a rectangular mesh and applying the surface 
spring creation method with a rigidity factor of two, will yield the traditional cloth mesh. In Figure 8.8 arc 
examples of a cow with rigidity factors 1-3, and in Figures 8.9-8.11 is the animation when the cow falls 
under gravity. 

8.4.2 Deformable Solids 

The spring modeling ideas introduced in the previous section can be extended from surfaces to solids. For 
instance, given a 3D rectilineal - grid, structural, shearing, and bending springs can be extended straightfor¬ 
wardly to the 3D case. However, one needs to consider spatial diagonals as well as shearing diagonals. 
Shearing diagonals will prevent area loss; spatial diagonals will counteract volume loss. The general¬ 
ization to unstructured solid meshes is also easily performed with the so-called volume mesh , such as a 
tetrahedral mesh. The algorithm shown in Figure 8.7 can be applied unchanged to this type of mesh. In 
Figure 8.12 is the cow simulation for an unstructured mesh. 

Unfortunately, in practice a rectilineal - grid or a volume mesh of the solid is not always available, but 
instead a 2D surface mesh of the object is given. To make a solid mesh, a voxelization of the interior of the 
surface could be performed and the resulting voxels could be used as a basis for creating a rectilineal - grid 
structure resembling the solid object. To keep particle and spring numbers as low as possible, a coarse 
voxelization is often applied. The surface mesh can then be coupled to the grid-structure by initially 
mapping each vertex into the grid-cube containing it. During animation, the vertex position can be updated 
when the rectilineal - grid deforms by using trilinear interpolation of the deformed positions of the grid 
nodes of the containing grid cube. The mesh coupling idea is a well-known technique for deformable 
objects [M.Muller, 2004b, M.Miiller, 2004a]. 
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algorithm create-surface-springs(rigidity, mesh) 
for each vertex V of mesh 
S = empty set of vertices 
traverse(V, V, 0, S) 
for vertices in S unmark them 
next V 

end algorithm 

algorithm traverse(V,A,distance,S) 
if distance >= rigidity then 
return 

S = S union A 
mark A 

for each edge E of A do 

let B be other vertex of E 
if B not marked then 

if not exist spring (A,B) then 
create spring(A,B) 
end if 

traverse(V,B,distance+1,S) 
end if 
next E 

end algorithm 


Figure 8.7: Pseudocode for breadth-first traversal creation of springs in a mesh. 


In [Jansson et al., 2002b, Jansson et al., 2002a] it is shown how spring models can be modeled based 
on rectilineal - grids. Furthermore, a mesh-coupling technique is adapted, although in their approach only 
structural springs are initially created. During animation it is tested whether particles come too close, 
i.e., whether the interdistance between particles drops below a user-specified threshold. In that case, a 
spring is created dynamically on the fly and kept active until the interparticle distance is increased beyond 
a spring-breaking threshold distance specified by the end user. 

A solid model can be created from the surface mesh by clever insertion of springs. This approach has 
been explored in [Mosegaard, 2003], where a cone-based spring creation method was presented. The idea 
herein is readily generalized to create volume preserving springs for surface meshes. Such a generalized 
approach will be presented in the following. 

Given an unstructured surface mesh, we let the vertices denote particles. Initially angle-weighted 
vertex normals [Aanass et al., 2003] are computed. These normals are favored, since they are independent 
of the tessellation of the mesh. For each vertex a cone is created, the cone will have apex at the vertex 
position, and the angle-weighted vertex normal will denote the cone axis. Next, all other vertices are tested 
for inclusion inside the vertex cone. For each vertex found inside the cone, a spring is created between the 
apex vertex and the contained vertex. The idea is illustrated in 2D in Figure 8.13. 

The cone angle works as a rigidity factor in very much the same way as the neighborhood size used 
previously. The larger the angle, the more springs will be created, and the surface mesh will have more 
resistance toward collapsing. The creation method is independent of the specific tessellation. Even meshes 
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Figure 8.8: A cow setup with rigidity factor equal to 1-3 from top to bottom. Simulations arc shown in 
Figures 8.9-8.11 respectively. 


with faulty topology can be handled, since only vertex positions arc used. 

One drawback with this spring creation method is that if the cone angle is too small, it may occur that 
no vertices arc found to lie inside the cone angles, and an insufficient number of springs will be created. 
One possible remedy of this problem is to make the cone angle adaptive; for instance, dependent on the 
local surface curvature, i.e., the angle is inverse reciprocal to the largest principal curvature or the like. 

A computational cheap alternative to the spring cone creation method is to use an inward mesh extru¬ 
sion instead. Here an inward mesh extrusion is performed, for instance, by shrinking the original mesh 
by moving vertices inward along vertex normal directions. It is advantageous to use a vertex normal 
definition that determines normals independently of mesh tessellation [Aanses et al., 2003]; the extrusion 
lengths can be computed to ensure non-self-intersections as done in [Erleben et al., 2004]. 

Having obtained the extruded mesh surface, it is now a simple matter to create springs based on mesh 
connectivity 

and pairing extruded vertices with their originals. This is illustrated in Figure 8.14, where it can be 
seen how to create structural and shearing springs. 
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Figure 8.9: A cow being deformed under gravity with rigidity factor equal to 1. Compare with Figures 8.10 
and 8.11 


Compared with the other methods for solids, this method has the advantage of being simple to im¬ 
plement and very fast. Furthermore, since only a minimum of internal particles arc created (vertices of 
extruded surface), the method has a performance advantage compared to the voxelization approach de¬ 
scribed previously. 

It should be noted that a voxelization approach often requires a watertight surface mesh; otherwise the 
voxelization may be error prone. Methods do exist [Nooruddin et al., 2003] using layered depth images 
(LDI ) for creating binary images that can be used for repairing a polygonal mesh or moving least squares 
approaches [Shen et al., 2004], which creates implicit surface models. 

It may be more attractive to be able to work directly on a degenerate mesh , i.e., one with topological 
errors, such as edges having more than two incident neighboring faces. Especially, graphics artists or 
content creators are famous for producing meshes, which arc a very poor starting point for creating a 
model for a physical simulation. Thus, to circumvent these problems, one approach is to accept life as it 
is, and simply use methods that arc robust and stable toward artistic errors in visual content. The cone- 
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Figure 8.10: A cow being deformed under gravity with rigidity factor equal to 2. Compare with Figures 8.9 
and 8.11 
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Figure 8.11: A cow being deformed under gravity with rigidity factor equal to 3. Compare with Figures 8.9 
and 8.10 
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Figure 8.12: Simulation using springs created from cubes in a voxelization of a cow mesh. 


based spring creation method and the breadth-first traversal spring creation methods can both be used on 
such faulty meshes. Whereas the other spring creation methods often rely on a perfect error-free mesh, 
sometimes even a watertight mesh. 

8.4.3 Deformable Plants 

More organic-like shapes such as plants and trees arc also easily modeled by particle systems with clever 
spring creations. 

A simple straw of grass may be modeled as a string of particles like pearls on a string. Structural 
springs and bending springs arc inserted to provide the string with stiffness as shown in Figure 8.15. 
The same principle can be applied to model ropes, grass, and hair. If more stiffness is wanted, more 
springs can be created in a similar fashion to using 3- and 4-neighborhoods etc. Another possibility is to 
introduce angular springs. Here a rest angle, (9 rest , is assigned between the two lines running through three 
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normal 



Figure 8.13: Angle-weighted cones used for creating springs, which add internal structure to a surface 
mesh. 


consecutive neighboring particles. During simulation, the current angle, 9 , is computed and an energy 
function can be designed as E = — ^k(9 — 9 mst ) 2 , where k is a kind of spring coefficient. The negative 
gradient of the energy function yields a restoring penalty torque that can be applied to the particles. In 
Section 8.5, we will show how energy functions can be designed in a general way and used to add forces 
to the particles. 

If more complex structures arc needed, branches can be added to the model to give a treelike effect. 
Flowever, extra springs must be inserted between the branches and the tree trunk. This schematic is shown 
in Figure 8.16. In order to appeal - realistic in areal application, the above particle models must be rendered 
properly. For the treelike model, a mesh coupling similar to traditional skin-deformation can by used to 
model the geometry of the trunk and branches. Leaves and finer structures are more easily handled by 



(a) Original Surface 
Mesh 





(d) Structural and 
Shearing Springs 


Figure 8.14: Spring creation based on inward mesh extrusion. Springs are shown as thickened lines. 
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Structural Bending Grass 

Springs Springs Straw 


Figure 8.15: A simple grass straw mass-spring model. 



Figure 8.16: A simple tree mass-spring model. Notice springs between branches and trunk. 


adding small texture maps (also known as billboards) attached to particles. 


8.4.4 Pressure Forces 

Pressure forces arc based on traditional thermodynamics, and they can by used to model balloonlike ob¬ 
jects, which arc often called pressure soft bodies [Matyka et al., 2003]. Such objects allow for interesting 
effects such as inflation, or creating fluffy looking deformable objects, as shown in Figure 8.18. 

Initially, a particle system based on a watertight two-manifold is set up; for instance, by creating 
surface springs using the method for unstructured meshes outlined in Section 8.4.1. 

From physics, it is given that the norm of a pressure force F acting on a surface is given as the 
pressure, P, divided by the area, A, of the surface, that is, 



P 

A' 


(8.25) 
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The direction is given by the normal, n , of the surface, that is, F = ( P/A)n. 

We will assume that the pressure inside the object is evenly distributed, which is a good assumption 
for systems in equilibrium. The assumption implies that each face on the surface of the unstructured mesh 
is subject to the same pressure. Thus, if the area of the i’th face is ,4 t . then the magnitude of the force 
acting on that face is given by P/Ai. However, our particles correspond to the vertices of the mesh, so 
in order to distribute the force acting on the face to the vertices, we assume that the face force is evenly 
distributed to each vertex of the face. Thus, for C, vertices on the face, for the j’th vertex of the face the 
force f ■ is added 

fj = n i~Tr r '> ( 8 - 26 ) 

where nj is the normal of the vertex. Of course, this is not entirely physically accurate, but for fine 
tessellated objects with good aspect ratio, i.e., no slivers or obscured triangles, this is a fairly good ap¬ 
proximation. In practice the error does not affect the aesthetics of the motion, and a plausible balloonlike 
effect is easily achieved. 

The only problem with the approach so far is that the value of P is unknown. Letting P be user 
specified allows the user to control the pressure forces by altering the value of P directly. But such an 
approach cannot model the effect of an increasing internal pressure as an object gets squeezed, nor can it 
model the falling internal pressure of an extended object. This implies that we need to consider the change 
in volume of the object. Fortunately, the Ideal Gas Law 

PV = nRT, (8.27) 


may be used for just that. The constant R is known as the gas constant, n is the number of molar of gas 
proportional to the number of molecules of gas, T is the temperature, and V is the volume of the gas. The 
ideal gas law can be used to compute the internal pressure inside the object: first, the current volume of 
the object is computed and then the pressure is found as 


P = 


nRT 

V 


(8.28) 


For animation purposes it is impractical to specify n and R; instead it is easier to specify an initial internal 
pressure Pj n j t j a i, and then compute 


CnRT — -/'initial initial > 


(8.29) 


where initial is th e initial volume. Having computed the constant c„rt, the pressure can now be computed 
as 


P = 


CnRT 

V 


(8.30) 


This approach also allows an easy way to interact with the pressure forces. Letting the user change /-(ninai 
over the course of animation, and then recomputing the constant c„rt yields an easy and intuitive way of 
controlling inflation and deflation of the object. 

Figure 8.17 illustrates the pressure force computations in pseudocode. 
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algorithm apply-pressure-force(mesh) 
V = volume(mesh) 

P = nRT / V 
update(normals(mesh)) 
for each face F in mesh do 
C = count(vertices(F)) 

A = area(F) 
f = P/ A C 

for each vertex v in F do 
let n be normal of v 
force(v) = force(v) + f*n 
next v 
next F 

end algorithm 


Figure 8.17: Pressure force computations. 



Figure 8.18: Cow falling under gravity while being inflated. 
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8.4.5 Force Fields 

The last type of force example we will present is based on interpolation. A flow field of either forces 
or velocities can be computed offline, and sampled on some grid structure or other spatial arrangement. 
Particles arc then let loose, and during simulation their spatial positions arc used to look up their position 
in a grid arrangement. Hereafter, the flow-field values arc interpolated and applied to the particle. This 
was used in [Rasmussen et ah, 2003] to create large scale effects, such as nuclear explosions in a movie 
production. 

8.4.6 Spline-Trajectory Controlled Particle System 

In some cases, one wants to plan how force fields lie in space to give an overall control of the particle 
trajectories. Imagine some kind of animation effect of smoke or fire traveling though a building; that is, 
going through one door, turning around a corner, moving along the ceiling, and out through another door. 
We have a general idea of the average particle trajectory, which is a completely planned motion. The fine 
details and perhaps turbulent behavior of the particulates should be dynamically simulated as the particles 
move along the trajectory. This is feasible because if the trajectory goes through some obstacle, such as a 
chair or table, then we want particles to interact in a physically plausible way with the obstacles but still 
keep on track. 

One possible solution is to use a huge force field sampled on a regular grid, wherein the entire con¬ 
figuration, i.e., the whole building, is contained. However, this is not a good choice: first, it is infeasible, 
since it will require a huge amount of memory to store the samples. Second, it would be computation¬ 
ally disadvantageous to initialize the force field with values, such that particles would follow a predefined 
trajectory. Finally, if an animator wants to change the trajectory to fine tune the effect, it would require 
recomputation of the entire force field. In conclusion, the force field solution is slow and cumbersome to 
work with in this case. 

Another solution is to insert attraction and repulsive force field on some algebraic form to control the 
particle trajectory. However, in practice it requires a lot of tuning to receive the wanted effect, there is no 
explicit control, and the particles may completely escape. 

The technique we will present next provides a tool for controlling the trajectory of a particle system 
in the way we just described. The method is inspired by the work in [Rasmussen et ah, 2003]. 

Let the trajectory be defined by a B-spline as described in Chapter 24 (other spline types could be used, 
but we favor B-splines). Then the animator can easily control and change the trajectory by manipulating 
the control points of the spline. 

Given a spline parameter u, one can compute the corresponding world point piu) by 

n 

p( u ) = '^2 N i,K(u)P i (8.31) 

i =0 

where Njj< is the i’th basis function of K 'th order and P, is the i’th control point. There is a total of 
n + 1 control points. The evaluation of the spline point shown here is not very efficient but it is sufficient 
to give intuition behind the described method. Efficient spline implementation is outlined in Chapter 24. 
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For simplicity, we will assume that the spline is a sufficient regular spline. That is, we assume that it 
has no cusps and that the first- and second-order derivatives, the tangent and normal vectors, arc always 
nonzero. It is thus possible to assign an intrinsic coordinate frame to each point, u, along the spline in a 
unique and well-defined way. First, we compute the tangent vector, t(u), by straightforward differentiation 
of (8.31), thus 

t{u) 

The normal vector n{u) is then given by 

n(u ) 


dNij<(u) p 


i =0 


du 


d 2 Ni t x(u) p 


i= 0 


du 2 


(8.32) 


(8.33) 


The cross product yields the binomial vector b(u). The vectors t(u), n(u), and b(u), arc generally not unit 
vectors unless the spline-parameter u is identical to the arc length. Therefore, we perform a normalization 
of the three vectors. We now have three linear-independent vectors, all orthogonal to each other; thus 
normalization yields an orthonormal basis, 


F{u) 


t(u) n(u ) b(u) 

Jl*(«)ll 2 M«)ll 2 ll b (M)ll 2 

[x(u) y{u) z{u)\ 


(8.34) 

(8.35) 


where we have introduced the shorthand notation x(u), y(u), and z(u ) for the unit axis coordinate vectors. 
The coordinate frame F is also known as the Frenet frame. The spline can thus be seen as a rotational 
warp of the initial Frenet frame F(0). At time point u, along the spline the frame will have orientation 
F(u) and origin will have been translated to p(u). The particles will live in this strange coordinate system 
whenever the forces applied to the particles need to be evaluated. 

Say a particle is located at some world coordinate position r wcs (t), where t denotes the current time. 
The closest point on the spline is found, p(u), and the world coordinate position is transformed into the 
warped spline coordinate frame by the coordinate transform 

^spline(/) = F(u) T (r wcs (f) - p(u).) (8.36) 


Notice that if p(u) is truly the closest point on the spline, then r sp | me (/) should have a component along 
the i-axis of F(u). 

We place 2D grids at each distinct knot-value of the spline. This is equivalent to placing the 2D grid 
at the data points of the spline. This is illustrated in 2D in Figure 8.19. The grids arc placed such that the 
spline goes through the center of the grid. The . 7 ;-axis and y-axis of the grid arc aligned with the ro-axis 
and 6-axis of the Frenet frame at the spline point where the grid is located. The general idea is to find 
the two 2D grids, where the particle lies in between, as shown in Figure 8.20. It is a rather trivial task to 
find these grids, once the closest point on the spline has been determined. If the closest point corresponds 
to spline-parameter value u. then the grid lying before is located at the largest knot-value u _, such that 
U- < u. Similarly, the grid lying after the particle is located at the smallest knot-value u + , where u + > u. 
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t-axis 

A 



Figure 8.21: Illustration showing how particle positions arc mapped onto a 2D grid, using the Frenet frame. 
Observe that the spline goes through the center of the grid. Grid node with indices (0,0) is therefore offset 

by ’’offset- 


Next, we map the particle position onto the two grids using the Frenet frame axes. Figure 8.21 illus¬ 
trates the coordinate mapping process. 

First, we must account for the spline going through the midpoint of the 2D grid, a priori this offset, 
’’offset is known, therefore we get the particle position with respect to the smallest grid node by 


’’grid — ’’spline ’’offset (8.37) 

Hereafter, we project r gnc | onto the normal and binormal axis. If the grid resolution along the axes is given 
by Ay and A z, then the particle is located in the grid cell given by indices 


V = 

z = 


V ' ’’ grid 

Ay _ 

•2 ' V grid 

- Az . 


(8.38) 

(8.39) 


Knowing the cell indices, it is trivial to perform a bilinear interpolation of the values stored in the four 
corner nodes of the cell, which the particle is mapped into. Thus, we now have two grid-interpolated 
values /(w_) and f(u+), the value at the particle position can now be found by a linear interpolation of 
the found grid values, that is, 


/(«) 


u .|_ — u 

U + — U- 


/(«-) + 


U — U- 


U + — U- 


/(«+)■ 


(8.40) 
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The technique can be used to define any kind of field values. In the example we used to derive our 
equations, we simply assumed that a scalar field was stored in the 2D grids placed along the spline. 
However, the formulas arc identical if we were to store a vector field, such as the force field. The 2D grids 
could be placed at other positions than those identified by distinct knot values of the spline. 

8.4.7 Free-Form Deformation Particle System 

Sometimes increased performance can be obtained by decoupling the geometry used for visualization from 
the dynamics model used for simulation. The concept is often called mesh coupling [Muller et al., 2004a] 
or a cartoon mesh [Hauser et al., 2003]. 

Most of the modeling techniques described in previous sections were based on creating springs di¬ 
rectly from a given surface mesh. It is well known that meshes used for visualization have a large number 
of triangular faces—millions of faces arc not unheard of. Creating springs for a particle system resem¬ 
bling such a large mesh will yield a dynamic model that is computationally intractable for real time and 
even interactive usage. Fortunately, it is possible to render such large meshes very efficiently using graph¬ 
ics hardware. Therefore, it is a viable approach to create a low-resolution dynamics/physical model for 
computing deformations in a physical plausible way, and then apply the computed deformation to the 
high-resolution mesh, which should be used for visualization. Another benefit from mesh coupling is that 
the problem of error-prone meshes mentioned earlier can be circumvented rather elegantly, as we will 
illustrate in the following. 

We will demonstrate a method based on free-form deformations ( FFD ). Free-form deformation is one 
of the early techniques in computer graphics [Sederberg et al., 1986] for animating deformations. Here 
we will briefly present the general idea. 

Initially we will set up a B-spline tensor, i.e., a 3D array of control points, where P t j t k denotes the 
( i,j , k )'th element of the array, and the B-spline basis functions Nj x(t), and Aq.ft-(?./.), where 

K is the order of the spline, and the indices i, j, and k indicate the beginning of the knot-interval where 
the spline is evaluated (is dependent on). See Appendix 24 for an introduction to B-splines. 

Then the point, p, corresponding to the spline parameters s, t, and u in the spline tensor is computed 
as 

n i Hi 

p(s,t,u ) = y J2 N i,K(s)N j j < (t)N k)K (u)P i j : k, (8.41) 

k= 0 j= 0 i =0 

where n, + 1, rij + 1, and /q. -f 1 is the number of control points along the i-, j-, and k- axes. Exploiting 
the fact that the basis function is only evaluated on a finite support of the knot vector leads to efficient 
implementations (see [Hoschek et al., 1993, Piegl et al., 1995] for details). In the case of B-splines, the 
basis functions arc polynomials of order K, (8.41) and arc simply a weighted sum of vectors, where the 
weights arc given by polynomials. 

The control point array is set up so that all control points lie in a 3D rectangular grid, axes aligned 
with the world coordinate frame. It is trivial to see that in case of B-splines, keeping two parameters fixed 
while varying the last yields straight orthogonal lines aligned with the world coordinate system. 

Picking a uniform knot vector in this initial position, the world coordinate system and the spline 
parameter space arc in perfect alignment, except for a shifting, given by the value of the control point 
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Algorithm init(Mesh mesh) 

P = new vector3[n_i+l][n_j+l][n_k+l] 
p_ min = vector3(-infty,-infty,-infty) 
p_max = - p_min 
for each vertex v in mesh 

p_min = min(p_min, v.position) 
p_max = max(p_max, v.position) 
next v 

p_min = p_min - vector3(border,border,border) 

di = (p_max_i - p_min_i) / n_i 

dj = (p_max_j - p_min_j) / n_j 

dk = (p_max_k - p_min_k) / n_k 

P [0] [0] [0] = p_min 
for(int k=0;k<=n_k;++k) 
for(int j=0;j<=n_j;++j) 
for(int i=0;i<=n_i;++i) 

P [ i] t j1 [k] = P[0] [0] [0] + vector3(i*di,j*dj,k*dk); 
for each vertex v in mesh 

v.parameter = v.position - P[0][0][0] 
next v 

End algorithm 


Figure 8.22: Initialization of control point grid and mapping of mesh vertices into spline parameter space. 


with lowest coordinates. This means that mesh vertices arc easily mapped into the spline parameter space 
by storing the original value of the vertex positions minus the minimum coordinate control point. During 
simulation the control point positions arc altered, whereby the spline parameter space is warped. The 
coupled mesh can be reevaluated by using the original vertex position as spline parameters in (8.41). The 
computed position is then used when rendering the mesh. 

Figures 8.22 and 8.23 illustrate the initialization process and rendering process of FFD mesh coupling. 

The figures arc oversimplified to show the main concepts clearly. In practice, any kind of splines and knot 
vector could be used; it will only complicate the initial mapping of mesh vertices into the spline parameter 
space. Also the reevaluation of mesh vertex positions could be implemented far more efficiently than the 
brute-force evaluation we have outlined here. 

We will now explain how to create a dynamic model for simulation using the FFD grid. A particle is 
created at each control point in the array. The array partitions a region of space into cubes, such that the 
cube labeled by ( i,j,k ) consists of particles: ( i,j,k ), (i+l,j,k), (i,j+l,k), (i+l,j+l,k), (i,j,k+ 1), 
(i + 1 ,j,k + 1), (i, j + 1, k + 1), and (* + 1, j + 1, k + 1). When assigning mass value to particle (i, j, k), 
look up the number of vertices from the mesh mapped into the eight neighboring cubes given by the labels 
(*- 1 ,3 - (*- l,j, k), (■ i,j - 1 ,k), ( i,j,k ), - l,k + l), (i — l,j, k + 1), ( i,j - 1, k + l), 

and (i, j,k + 1). The mass of the particle is found by the total number of mesh vertices in the neighboring 
cubes divided by eight and multiplied by some user-specified density value. Pseudocode for calculating the 
particle masses is given in Figure 8.24. Next, structural, shearing, and bending springs arc created between 
the control point particles. The details of such a spring creation method were explained in Section 8.4.2. 
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Algorithm show-mesh(Mesh mesh) 
for each vertex v in mesh 
s = v.parameter.s 
t = v.parameter.t 
u = v.parameter.u 

v.position = sum_k sum_j sum_i N_iK(s) N_jK(t) N_kK(u) P[i,j,k] 
next v 
draw mesh 
end algorithm 


Figure 8.23: Evaluation of vertex positions before drawing coupled mesh. 


algorithm mass-assignment(density) 
for each particle p in system 
i,j,k = index of p 
Cl = cube[i—1,j-1,k] 

C2 = cube[i—1,j,k] 

C3 = cube[i,j-1,k] 

C4 = cube[i,j,k] 

C5 = cube[i-1,j-1,k+1] 

C6 = cube[i-1,j,k+1] 

C7 = cube[i,j-1,k+1] 

C8 = cube[i,j,k+1] 

mass(p) = density* (vertices(Cl)+...+vertex(C8))/8 
next p 

end algorithm 


Figure 8.24: Mass computation of particles 


We now essentially have a physical model, consisting of the control point particles interconnected with 
springs. We can subject the particles to any external forces, whereby the control point particles will move 
and deform the spline parameter space. After the simulation, the coupled mesh vertices are reevaluated 
using the deformed control points, and the coupled mesh is redrawn. 

Essentially we can choose two ways in which the FFD particle system can interact with the surround¬ 
ings: either the collision detection or response is applied directly to the control point particles. 

This is a simple and straightforward solution. It lends itself to a simple particle system implementation 
since there is only a one-way communication with the coupled mesh, i.e., there is no feedback from the 
coupled mesh to the FFD particle system. The coupled mesh is completely dependent on the FFD particles. 
This solution also poses a performance advantage since the count of control point particles is expected to 
be extremely low compared with the number of vertices in the coupled mesh. 

On the downside, the interactions can appeal - quite unrealistic, since the grid of FFD particles has 
nothing in common with the deforming geometry of the coupled mesh, except that we know the convex 
hull of the control points encloses the underlying geometry. 

To create a more plausible interaction with the surroundings, we need two-way communications be- 
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tween the coupled mesh and the FFD particles. This can be accomplished by performing collision detec¬ 
tion between the deforming coupled mesh and the environment. This will generate a set of contact points 
(see Chapter 14 for details on contact determination) between the coupled mesh and some other obstacles 
in the configuration. 

Here we will assume that an arbitrary simulation method is used to compute collision impulses or 
contact forces that should be applied at the contact point in order to prevent penetration, and thereby cause 
a deformation. 

To keep things simple at this point, we will simply perform penetration tests between the coupled 
mesh vertices and other obstacles. This means that a generated contact point will have the same position 
in world space as a deformed mesh vertex. 

We can thus remap the contact point into the spline parameter space by looking up the spline param¬ 
eters of the deformed vertex. The spline parameters will indicate the cube in the FFD grid which the 
deformed mesh vertex was originally mapped into. Thus, we have the idea of performing a reverse inter¬ 
polation for finding equivalent contact forces/impulses to apply at the corner control point particles of that 
cube. 

From Newton’s second law we know forces obey the superposition principle, which means that given 
the contact force, F applied at the deformed vertex position r, then 


F — f i 4 + f% 


(8.42) 


where /., is the force applied to the f th particle. In addition, it seems reasonable to require some balance 
of torque, thus 

rxF = r\Xf 1 -\ - hr 8 x/ 8 (8.43) 

where r., is the current position of the /’th control point particle. These two equations thus define two 
requirements for an inverse interpolation. We need to determine the particle forces. 

Using the skew-symmetric cross product matrix defined in Proposition 18.4, this yields a system of 
equations 



f 

Af = b 


(8.44) 


(8.45) 


Notice that A 6 R 6x24 , f 6 R 24xl , and b 6 R 6xl , thus we have more unknowns than equations, and 
the system is underdetermined In general, this means that we have more than one solution to our problem 
and as a practical consequence A cannot be inverted. To circumvent these difficulties, we can simply 
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algorithm reverse-interpolation(di,dj,dk) 
clear forces 

for each contact point cp 

v = corresponding vertex of cp 
s,t,u = v.parameters 
i,j,k = s/di, t/dj u/dk 

pi = particle[i,j,k] 
p2 = particle[i+1,j,k] 

p8 = particle[i+1,j+1,k+1] 

solve for fl,. . ., f8 

add fl to pi 

add f8 to p8 
next cp 
end algorithm 


Figure 8.25: Applying contact forces from coupled mesh to FFD particles. 


use a singular value decomposition ( SVD ) solution or pseudoinverse of A to obtain the minimum norm 
solution. 

An SVD decomposition of A yields A — UDV 1 , where D is a diagonal square matrix with sin¬ 
gular - values in the diagonal. The matrices U and V are orthogonal in the sense that their columns are 
orthonormal. A minimum norm solution to the problem is then given by 

f = VD 1 ( U T b ) (8.46) 

The solution minimizes the norm of the residual r = Af — b (see [Press et al., 1999b] for more details 
on SVD. Figure 8.25 illustrates the force computation). 

Control-point particle collision impulses j 1 ,... j 8 can be computed in a si mi lar fashion given some 
simulation methods, which can compute the impulse J to apply at the contact points. The main difference 
from the contact force computation is that impulses instantaneously change the velocities at the parti¬ 
cles they are applied at, whereas contact forces are accumulated into a force vector, which is used in a 
numerical integration scheme. 

8.5 Energy Systems 

Forces are the handles for controlling the behavior of a particle system. Many forces are easily adapted 
from physics to a specific animation system, such as gravity, viscosity, and spring forces. Nevertheless, 
animation systems often have other forces that are not based on reality, but are added for convenience. As 
an example, if a particle is not allowed to penetrate an object, then a force field is often applied around the 
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object. For example, a force field around a sphere could be produced by an energy or potential function, 
such as 


E{r) = ||t* - r 0 || 2 - R 

= V( r ~ r o ) ' (r - r 0 ) - R, 


(8.47a) 

(8.47b) 


where ro is the center of the sphere, and R is its radius. The implicit function E is positive outside and 
negative inside the sphere. The gradient of E may be used to define the direction of the repulsive force. 


dE(r) _ 2(r — ro) 

Of 2 y/(r - r 0 ) • (r - r 0 ) 

_ r - r 0 _ 

V(r - r o) ■ 0 - ro) 


(8.48a) 

(8.48b) 


since it always points away from the center of the sphere and has length 1, 


dE(r) 

2 

r - r 0 

dr 

2 

V( r - r 0) ■ (r - r 0 ) 


r — r o r — r o 

y /(r - ro) 7 {r - r 0 ) y/(r - r 0 ) • (r - r 0 ) 
= 1 . 


(8.49a) 

(8.49b) 

(8.49c) 


Having a repulsive force in all of space is typically not useful and the repulsive force is therefore often 
made strong only near the object. This may be done as 


= «-<•*)« ^§41, (8.50) 

where c and cr arc constants to control the strength and the width of the repulsive field. In this example, we 
have used a clamped exponential force, such that the force field diminishes quickly away from the surface, 
and such that the maximum force is limited to avoid stiff ordinary differential equations. An example is 
shown in Figure 8.26. 

As illustrated in the above example, forces may be designed as gradients of energy, potential, or 
penalty functions. Such vector fields arc also known as potential fields. The potential function for Hooke’s 
undamped spring law for two particles at position a and b, is given by 

E ab = ^k s (\l\-R) 2 , (8.51) 
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Clamped Expential Function 



Figure 8.26: An often-used strength for a repulsive force is the clamped exponential function. 


Distance preservation E a i , = ^k s (||r& — r a || — R)~ 

Areapreservation E abc = \k a (||(r 6 - r a ) x (r c - r a )\\ 2 - A) 2 
Volume preservation E abcd = \k a (det([r 6 - r a \r c - r a \r d - r a ]) - V) 2 


Table 8.1: Distance-, area-, and volume-preserving energy systems. 


where, l = r b — r a . This is easily verified by calculating the gradients of E ab with respect to r a : 


f, 


dE ab 


dr 


a 


-k a (\l\-R) 
—k s (\l\ — R) 

k s (\l\~R )|j| 


d(\l\-R) 

dr a 

-2 r b + 2 r a 

m 


(8.52a) 

(8.52b) 

(8.52c) 

(8.52d) 


and with a si mi lar calculation for r b . Hooke’s Law may be viewed as a distance-preserving energy func¬ 
tion. Area and volume-preserving energy functions may be defined in a similar way [Teschner et al., 2004], 
as summarized in Table 8.1. In the table reference, lengths, areas, and volumes arc given as R, A, and V, 
for the corresponding parallelograms. In case the reference areas and volumes arc given as triangular area 
and tetrahedral volume, then the norms should be divided by the factors \ ar| d | respectively. 


© 


© 


© 


© 





© 


“book” — 2005/9/30 — 15:44 — page 295 — #307 


£ 


© 


© 


8.6 PHYSICS-BASED ANIMATION 


295 


8.6 Constraint Dynamics 

Stiffness constraints arc difficult to integrate in Euler methods with springs, since these lead to stiff ordi¬ 
nary differential equations. However, there is a special case when k s —► oo, which may be solved directly. 
Use 


l(t) = r b (t) - r a (t) (8.53a) 

AZ(t) = \l(t)\ — r. (8.53b) 


Since the A l is antisymmetric in r a and r b , solving for A l(t + h) = 0 implies that 


r a (t + h) 
r b (t + h) 


A l l 

T ]7[ 

A l l 

2“|7 


(8.54a) 

(8.54b) 


Iterating over all the stiff springs and applying (8.54) until only minor changes arc performed is termed 
relaxation. Using Verlet integration together with relaxation and projection for nonpenetration is beneficial 
because velocities arc represented implicitly in the Verlet integration method. The implication is that 
when positions arc altered by relaxation or projection, then velocities arc implicitly modified to match the 
enforced constraints [Jakobsen, 2001]. 

In general, hard constraints can be enforced by adding constraint forces. In the following, we will 
discuss a general energy function, which is given as an implicit function E(r), for which the constraints 
arc fulfilled for all positions r' where E(r') = 0. Points on a unit sphere centered at the origin, E{r) = 
i(r • r — 1) is an example. For points on the surface r', we have E(r') = 0, and the velocities ensuring 
that E = 0 are found by requiring. 




(8.55) 


Likewise, accelerations fulfilling the constraint must satisfy 


8 2 dr dr d 2 r 

dt 2 dt dt dt 2 


= 0 . 


(8.56) 


An arbitrary force f will most likely not fulfill the above constraint, and we therefore seek a constraint 
force r/j. We can add to f such that the constraint is fulfilled, that is, 


d 2 r _ f + 0 
dt 2 m 

dr dr ^ r / + </> _ Q 
dt dt m 

, dr dr 

=^r • cp = —m — •- r ■ f. 

dt dt 


(8.57a) 

(8.57b) 

(8.57c) 
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The dimensionality of 0 is typically three, hence the above equation determines one out of three un¬ 
knowns. To determine the remaining unknowns we require that the force neither add nor subtracts energy 
to the system. This is the Principle of Virtual Work. Given the kinetic energy as 


m 1 dr dr 
T = -m— ■ —, 
2 dt dt 


(8.58) 


the principle of virtual work requires that 


dT dr d 2 r 
dt m dt dt 2 

= §-<' + *> 
_ dr_ 

dt 


(8.59a) 

(8.59b) 

(8.59c) 


Hence, 


dr 

dt 


0 = 0 . 


(8.60) 


We now have two out of three unknowns for 0. The final equation is obtained by requiring that (8.60) 
must be satisfied for every ^ fulfilling (8.55), which leads us to conclude that 0 must be proportional to 
r, that is, 

0 = A r, (8.61) 


The Lagrange multiplier A is found by inserting (8. 

.61) 

into i 

(8.57b), i. 

II 

bq 

(N 1™ 

dr 

dr 

+ r ■ 

d 2 r 

dt 

dt 

dt 2 


dr 

dr 

1 fin 

/ + 0 


dt 

dt 

~r T ■ 

m 


dr 

dr 

1 'Y* 

f + Ar 


dt 

dt 

~r T ■ 

m 


= 0 , 


(8.62a) 

(8.62b) 

(8.62c) 

(8.62d) 


and A is found to be, 


A -mf-f-r-/ 

r ■ r 


(8.63) 


For a system of particles, these equations generalize as follows: we generate a state vector as the 
concatenation of particle positions 

r 

r t 
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Newton’s second law for the complete system can then be written as 

F = Mq , (8.65) 


where F is the concatenation of force according to (8.65) and the mass matrix M is diagonal on the form 
of M = diag(mi, mi,mi,m 2 ,m 2 , m 2 , m 3 ,..., m n ). Similarly to the single particle system, we form a 
global constraint vector function C(q), such that the constraints are fulfilled, when 


C(q) = C(q) = 0. 

( 8 . 66 ) 

By differentiation, we find that this is equivalent to 



(8.67a) 

= Jq. 

(8.67b) 

The matrix J is called the Jacobian of C. Differentiation once more gives us 


C(q) = Jq + Jq 

( 8 . 68 a) 

8C(q ) . dC(q ).. 

dq q dq Q ' 

( 8 . 68 b) 

Note that the calculation of J involves 3x3 dimensional matrix, 


. BJ. 

B^ q 

(8.69) 

but the evaluation of jp- can be avoided by using an explicit representation for C as shown in ( 8 . 68 b). 

Now we will find a global constraint force 4>, which causes no work. Newton’s second equation then 
becomes 

q = M~ 1 (F + §), (8.70) 

and as for (8.57b) we write 

C{q) = Jq+ JM^(F + <S>) = 0 

(8.71) 

Using the principle of virtual work, we find the solution to the vector $ by requiring that it must not 
contribute work. Writing the kinetic energy as 

T = \q T Mq , 

(8.72) 

we find the work as the time derivative of the energy. 


T = q T Mq , 

(8.73a) 

= (M~\F + <S>)) T Mq, 

(8.73b) 

= ( F t + $ T )M" lT M 9 , 

(8.73c) 

= (F T + $ T )M“ 1 Mg, 

(8.73d) 

= F T q + $ T q. 

(8.73e) 
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Algorithm ParticleSystemStep(m,p,v,N,delta) 
// Calculate new forces 
for i = 0 to N 

f(i) = calculateF orces(p,v,N) 
end 

// Calculate constraint forces 
for i = 0 to N 

c(i) = calculateConstraints(m,p,v,N) 
end 

// Calculate the velocity in phase-space 
for i = 0 to N 
dp = [v,(f+c)/m] 
end 

// Update the position in phase-space 
p = p+delta*dp 
end algorithm 


Figure 8.27: Explicit fixed time-stepping of a simple general-purpose particle system. 

For not to contribute work, we thus require that 

<I> T q = 0. (8.74) 

For this to hold for any q, and using (8.66) and (8.67b) we conclude that 

= J T A, (8.75) 

where A is a vector of Fangrangians. Inserting (8.75) into (8.71) we find 

0 = Jq + JM~ l (F + 9 ?) (8.76a) 

= Jq + JM~ l {F + J T A) (8.76b) 

and with a simple rewrite we conclude that 

J M 1 J T A = — Jq — JM _1 F. (8.77) 

Unfortunately, numerical implementations of (8.77) introduce drifts. It is therefore useful to add a damped 
spring force to the system 

JM' 1 /A = -Jq - JM X F - k s C - k d C, (8.78) 

for some suitable spring and damping constants k s and k d . 

In [Baraff et al., 2003a] Witkin presents a simple algorithm simulating particle systems with con¬ 
straints, which is sketched in Figure 8.27. The mass is given in m, the point phase space is p, the number 

of particles is N, and the magnitude of a single step in the ordinary integration is given by delta. The 

function calculateForces (p,N) calculates the forces for the specific system. For example, for 
a piece of cloth it could be the sum of spring forces for the nearest particles on the grid of the cloth, 
and calculateConstraints (m, p, N) calculates the relevant constraint forces, such as collision 
response with a sphere. 
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Figure 8.28: A piece of cloth is modeled as a grid of particles connected in triangular mesh. The coordinate 
pair denotes the particle index. 


8.7 Large Steps in Cloth Simulation 


In [Baraff et al., 1998] the authors present an implicit method for simulating cloth with a particle system. 
The basic setup is shown in Figure 8.28, where the particles arc defined on a grid and connected in a 
triangular mesh. Defining the general position for the total system r as 


r i 

r = 

_r n 

we write Newton’s second law of motion as 


(8.79) 


Mr = f(r, r) 


(8.80) 


where / is the corresponding general force vector, and AT is a diagonal matrix with [mi, m 2 ,..., m n \ 
along the diagonal. For cloth, typical internal forces considered arc bending, stretching, and shearing, 
while external forces arc typically gravity and collision constraints. 


8.7.1 Implicit Integration for Cloth Animation 

Cloth strongly opposes stretching, while it easily allows bending, and this implies that the system becomes 
stiff. Some advocate explicit methods due to the importance of high-frequency response for realistic- 
looking images [Kacic-Alesic et al., 2003]. However, explicit integration of the forces requires an enor¬ 
mous number of small steps in order to avoid instabilities. An implicit integration scheme is defined as 


r(t + h) 
r(t + h) 

= 

r(f)' 

r{t) 

+ h 

r(t + h) 
r(t + h) 



= 

r(t) 

r{t)_ 

+ h 

M~ l f (1 

+ 

*+o 

± + 

^-+o 

•5- V 


(8.81a) 

(8.81b) 
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To solve the system we assume that f is linear, that is, f may be written on the form 
f ( r (t + h), r(t + h)) = f (■ r(t ), r(t)) + hVf 
where V/ is the Jacobian matrix taken at t. We can now write (8.81b) as 


r(t) 

r(t) 


(8.82) 


r{t + h) 
r(t + h) 


r (t) 

r(t) 



r(t + h) 



+ h 

M- 1 ^ f(t ) + hVf 

r(t) 

). 


(8.83) 


where we have used fit) = f ( r(t ), r(t)) for shorthand. Rewriting this equation in terms of the sought 
updates, 

r(t + h ) — r(t) 
r(t + h) — r(t) 


A r(t) 


A r(t) 



(8.84) 


and expanding the Jacobian, we find 

= h 


A r(t) 
A r(t) 


M 


r(t ) + A r(t) 

1 (. f(t ) + h^-r{t) + 


(8.85) 


Then we replace r(t) r(t) with their forward difference approximations, 

r(t + h)—r(t) A r(t) 


r{t) 

r(t ) ~ 


h h 

r(t + h)—r(t) A r(t) 


h 


h 


to give 


A r(t) 
A r(t) 


= h 


M 


r(t ) + A r(t) 

-'im+hSt^^ + hS§l^!l) 


Finally, we insert the top row in the bottom, 

A r(t) = hM~ l (j(t) + h ( r(t ) + A r(t)) + ^®A f(t)^ 


(8.86a) 

(8.86b) 

(8.87) 

( 8 . 88 ) 


We now have an equation only depending on M, r(t ), the first-order structure of f (r(t),r(t)), and 
A r(t), and for which the latter may be isolated as 


1 -h 2 M~ l ^P- -hM 
or 


Am = hM -■ (/(t) + h B J^m) , (8.89) 


where 1 is the identity matrix. This equation may be solved for Af(t), and A r(t) may be then be 
calculated using the top row of (8.87). 
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Figure 8.29: Mapping between indices, intrinsic, and world coordinates. 


8.7.2 Cloth Forces, Damping, and Constraints 

In [Baraff et al., 1998] the constraints arc formulated as positive vector functions C, that must be mini¬ 
mized. Energy is calculated as squares of constraint functions, E = |C T C, and the forces arc found as 
derivatives of the energy functions, / = = -kjC-C. 

To define the stretching, bending, and shearing forces, let’s consider a piece of cloth that has topology 
as a plane, implying that the cloth can be spread out in a single layer on a plane without holes. Consider 
a coordinate system intrinsic to the cloth, (u(i,j),v(i,j)), where (i, j) arc the indices of a particle. This 
coordinate system is unaffected by stretching, bending, or shearing, and is used to denote the relative 
positions of the particles with respect to each other, when the cloth is most at rest. We can then calculate 
the mapping between the intrinsic and the world coordinates 




x (hj) 


= w(u(i,j),v(i,j)). 


(8.90) 


This is depicted in Figure 8.29. 

Cloth strongly resists stretching, and we may define the amount of stretching by the functions 
and ^7 for the stretch in the u and v directions on the cloth respectively. We will assume that 11 ^ 11 = 
H^ll = 1 when the cloth is at rest. Assuming for a moment that the indices (i,j) arc continuous 
variables, we write the change in world coordinate x(i,j) as a function of index i and j respectively, 

dx 

di 
Dx 

dj 


dw du dw dv 
du di dv di ’ 

(8.91a) 

dw du dw dv 
du dj dv dj' 

(8.91b) 
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Figure 8.30: Three neighboring particles in the cloth mesh 


On matrix form this gives 


dx 


~du 

dv 


'dw' 

di 


di 

di 


du 

dx 


du 

dv 


dw 

_ dj _ 


-dj 

dj- 


- dv - 


(8.92) 


Thus, given three neighboring particles as shown in Figure 8.30, we may estimate ^ and ^ by finite 
differences and matrix inversion as 


'dw 



-1 


du 


u{i + 1 , 3 ) ~ u{i,j) v{i + 1 ,j) - v{i,j) 

x(i + l,j) - x(i,j ) 

dw 


u(i,j + 1 ) - u(i,j) v(i,j + 1 ) - v(i,j)_ 


x(i,j + 1 ) - x(i,j)_ 

- dv - 






The matrix in u and v is constant, hence ^ and are a function of x, and the constraint function is 


therefore defined to be 


Cstretch(®) — Oi 


"II 9w 

II du 

II dw 

_ll dv 



(8.94) 


where a is a user-specified constant controlling the relative strength of the stretch constraint. Likewise, 
we define shearing energy by measuring the angular change performed by the mapping. 


C shear 


dw dw 

du dv ’ 


(8.95) 


where j3 is a user constant controlling the relative strength of the shear constraint. This is an approximation 
to cos 9, with 9 being the angle between the vectors, and the approximation depends on ft, since this 
controls the relative enforcement of C s h ea r- 

For the bending constraint, we define it to be proportional with the curvature of the cloth, which in 
turn is proportional with the angle between two neighboring triangle normals, that is, 


Cbend = 7 cos 1 (ni • n 2 ), 


(8.96) 


where rt\ are the normals n- 2 , and 7 is a user constant controlling the relative strength of the stretch 
constraint. 
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Strong forces require strong damping forces to stabilize the numerical integration. It has been sug¬ 
gested to use the constraint function to define damping forces [Baraff et al., 1998] as follows: 


, _ , dC- 

di — kd C. 
OV i 

Inserting this into (8.89) we need to calculate the derivatives 


ddj _ f dC dC 

drj d y dr .i drj ~*~ 

ddj _ f dC dC T \ 

drj d \ dri dr.j ) 




(8.97) 


(8.98a) 

(8.98b) 


At points of contact between the cloth and other impenetrable objects, or as specified by the user, 
constraint forces must be applied. Instead of introducing springs with high spring constants, the mass may 
be set to infinity. At first glance this seems awkward, however, since (8.89) uses the inverse mass, setting 
a mass to infinity is equivalent to setting its inverse mass to zero. To complicate matters further, a particle 
in contact with a planar surface will only be constrained in the direction normal to the surface, and we 
need only set the inverse mass to zero in the normal direction. As an example, consider a single particle, 
r-j: Newton’s second law may be written on matrix form as 


r 


i — 



r l 


mi 

0 

0 


0 

1 

mi 

0 


fi, 


mt -1 


(8.99) 


in which case we may impose a constraint along one of the coordinate axes by setting the corresponding 
element in the diagonal to zero. The extension of the inverse mass to a matrix was done by multiplication 
with the identity matrix, © 1 , and constraining along an axis could be considered as the operation ©(T — 
S), where E is a zero matrix except at the relevant diagonal element to be constrained, which is one. In 
general we have 

(1-^ T )C = 0, (8.100) 

when £ = [6 , 6 j 6 ] T is a unit vector. To verify, we simply expand the products in three dimensions, 


(i - CC T )C = £ - 

= £- 

= £- 
= o. 


©i 2 

66 

66 


66 

6 2 

66 


66 

66 

6 2 


Ci + 6£ 2 + 6 Cl 
C 1 C 2 + Cl + C 2 C 3 
C 1 C 3 + C 2 6 + Cl. 

6(Ci +C! + Cl)" 
6(Ci +C| + C 3 ) 
6(Ci +C| + C 3 ) 


( 8 . 101 a) 


(8.101b) 


( 8 . 101 c) 
( 8 . lOld) 
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Thus we may specify the constraints in 0-3 directions by letting the inverse mass matrix be 


f J_ 

for 0 constraints, 

mi 

for 1 constraint. 


for 2 constraints, 

lo 

for 3 constraints, 


( 8 . 102 ) 


where £ and ry arc mutually orthogonal vectors. 

Finally, [ Bara IT et al., 1998] has had success with solving the above system with the conjugate gradient 
method. 
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Continuum Models with Finite Differences 


In this chapter we will discuss continuum models with finite differences for the simulation of deformable 
objects. This was first presented for the animation community in [Terzopoulos et al., 1987] and later 
widely discussed [Kelager et al., 2005]. The model typically leads to rubbery objects, as illustrated in 
Figure 9.1. 

The original method was designed using finite differencing, but many have later turned finite elements 
as the computational model. The finite element method will be discussed in Chapter 10. 

9.1 A Model for Deformable Objects 

A deformable object can be identified as a point cloud. The positions of these points describe the geometry 
of the object. Any point of a deformable object can at any time be identified by a set of unique parameter 
values. We call these parameter values the material coordinates and denote them with the symbol a. For 
a solid, a = [ai, < 22 , 03 ©, for a surface, a = [ai, a 2 ] T , and for a curve, a = [ai]. 

As the object deforms, points belonging to the object move. The position of an object point is a time- 
varying vector depending on the material coordinates. A point belonging to an object in Euclidean space 
is denoted r and is given by 


r(a,t) = [ri(a,t),r 2 {a,t),r 3 (a,t)]. (9.1) 

In Figure 9.2, we show the relationship between a surface described by material coordinates and the 
points in Euclidean space. 

The deformation of an object is modeled using elastic forces. When an object is in its rest shape, 
meaning that it is undeformed, there is no elastic energy stored in the object, but when the object is 



Figure 9.1: Examples of models using the continuum formulation, kindly provided by Micky Kelager 
and Anders Fleron, at the time of writing students at the Department of Computer Science, University of 
Copenhagen. 
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Figure 9.2: Parameterization of a deformable object. 


deformed, an elastic energy is built up in the object. The deformed object wants to return to its rest shape, 
which is equivalent to minimizing the elastic energy stored in the object. Thus, simulating deformable 
objects is all about energy minimization. 

The deformation of an object in a given point is denoted with a potential elastic energy function 
£ = £ (r(a, t)). This energy should be independent of rigid-body movement. 

We work in a continuum model, which means that we consider the object as a whole, and not just a 
representation of the object, for instance, its center of mass. This means that a differential volume of the 
object is assigned a density of mass, a density of energy, a density of damping, and so on. Thus, in the 
limiting case, we have a mass density function, an energy density function, etc., describing the distribution 
of densities at any point in the object. 

The Lagrangian density, see Section 22.5, is an energy density function, and it can be written as 

C = \p{a) — £(r(a,t)). (9.2) 

Here p{a) is the mass density in a given point, and the derivative of r with respect to time, describes the 
velocity of motion. The object deforms due to some applied external forces, f (a K and the deformation is 
damped by a damping force, f id \ given by 

f (a) = f(r(a,t),t), 

/<“> = _ T 


(9.3a) 

(9.3b) 


where 7 (a) is the damping density. The damping force resists deformation, while the applied external 
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force can be a sum of several different kinds of forces. Examples of these external forces arc given later 
in Section 9.4. 


9.1.1 The Lagrangian Formulation 


The equations of motion for a continuous object can be described by the Lagrangian formulation. The 
theory and derivations can be found in Section 22.5. The specific equation we use for our model. Equa¬ 
tion (22.186), is repeated here: 


d f d£ \ 



(9.4) 


Now let us try to substitute the expression for the Lagrangian density of our deformable model, (9.2), 
into the above Lagrangian formulation: 


d_ 

dt 


( d | 

flM(a) 1 

( dr(a,t ) N 
{ dt ) 

1 -£(r(a,t))y | 

1 51 

(Vw ( 

V 


adr(a,t) 

° dt J 



Sr (a, t) 


(9.5) 


If we carry out the partial derivatives on the left side of the equation, we obtain 


d_ 

dt 



(^)) 


5£{r{a,t )) 
Sr (a, t) 


Q. 


(9.6) 


The quantity Q is the total sum of applied forces. In our model, these arc the external forces, / (a) , and 
the damping force, / (,/) . Substituting the expressions from (9.3) for these forces yields 


d_ 

dt 



(^)) 


S£(r(a, t)) 
Sr(a , t) 


f(r,t(a,t)) — 7 ( a ) ~~~rr~^ 


(9.7) 


If we move the damping force to the left side, and exchange absolute derivatives with partial derivatives 
wherever possible, we get our Lagrangian model for a deformable object under motion: 


d_ 

dt 


/x(a) 


dr (a, t ) 
dt 




dr(a,t ) S£(r(a,t)) 


dt 


+ 


Sr(a , t) 


= f(r(a,t),t), 


(9.8) 


where r(a,t) is the position of material point a at time t, ji(a) is the mass density, 7 (a) is the damping 
density, f ( r(a , t), t) is the applied external forces, and £ ( r(a , t) ) is potential energy due to the elastic 
deformation of the body. The derived equation is an exact match of the deformable model introduced in 
[Terzopoulos et al., 1987], 

To ease notation, we omit the explicit dependencies on position a, and time t. The equation of motion 
is then reduced to 


d 2 r dr 


5£{r) 
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We find a connection with classical mechanics when we assign labels to the different parts of the equation. 
For example, // has dimension of mass, and is acceleration; thus p'jjjf is mass times acceleration, 
which we denote with m • a. Similarly, we find that 7 has dimension of viscosity, 'jjj- has dimension of 
velocity, and combined they have dimension of force, e.g., T%; = -^viscous- Continuing this, we can put a 
label on each of the terms in the equation of motion, which yields: 


8 2 r dr 5£ ( r ) 

FliTT + 7 — + -77— = f © 


^elastic 


-^external 


(9.10) 


Doing so, we can gather the viscous, elastic, and external force, as F tota i = F externa i — (-^viscous + -^elastic)- 
With this notation, we find that our equation of motion is equivalent to Newton’s second law: 


m - a, — Fjotai. 


(9.11) 


Normally, we think of Newton’s second law used directly on one body. In the Lagrangian formulation, 
the body is parameterized. It is therefore a continuous description. 


9.1.2 Designing the Energy Functional 

The £(r(a , t)) functional describes the potential elastic energy of our deformable object. This functional 
is only dependent on the form of the object. We do not want our object to change form, depending on its 
position in the world, so we need a description of form that is invariant to rigid body motion. 

Differential geometry is well suited to describe the local shape of any kind of object. Therefore, we 
can use this description to set up an energy functional, which penalizes a deforming shape with respect to 
its natural or initial shape. When the simulation is run a force will be applied, pulling the body toward its 
natural shape, until equilibrium is reached. Thus, penalizing is equivalent to energy minimization and the 
equilibrium shape is equal to energy minimum. 

The first fundamental form or the metric tensor is a measuring tool for points in an object. It describes 
how to measure distances between points in the object, independent of the world coordinate frame. For a 
point r in our parameterized object, the metric tensor 


©1 G 12 
G21 G22 


is a symmetric matrix with elements defined by 


Gij (■r(a)) 


dr dr 
dai daj 


(9.12) 


(9.13) 


The differential distance between two nearby points with material coordinates a and a + da is then found 
by 

dl = ^2 Gij (r*(a)) dciidaj. (9.14) 
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Figure 9.3: Distance between points on surface is independent of the mean curvature. 


For 3D solid objects, the metric tensor is a 3 x 3 matrix. If two solids have the same shape, their metric 
tensors arc identical functions of a. However, for 2D surfaces, this description is no longer sufficient. The 
distance between points on a surface is independent of the mean curvature of the surfaces as seen in 
Figure 9.3, so we need to look at the second fundamental form or the curvature tensor. For a point in our 
parameterized surface, the elements of the curvature tensor. If arc defined as 


Bij ( r(a )) = n • 


d 2 r 

dai ■ daj ’ 


(9.15) 


where n = [n \ , n-i , is the unit surface normal. If two surfaces have the same shape, their 2x2 metric 
tensors, G, as well as their 2x2 curvature tensors, If arc identical functions of a. 

Finally, if we restrict our deformable objects to ID curves, we need yet another measurement to 
account for twisting, in order to decide if two curves have the same shape, as illustrated in Figure 9.4. The 
metric and curvature tensors arc now reduced to scalars, called arc length s and curvature n. Since the 
curve can twist, we need to compare their torsion r, which is defined as 


r = n ■ (t x n )', (9.16) 

where t is the tangent, (t x n) is the binormal, and (t x n)' is differentiation of the binormal with respect 
to the arc length parameterization. In the following, we will restrict the description to 2D surfaces. It is 
easy to incorporate the torsion for curves, or remove the curvature for solids. 

In the model for a deformable object, (9.8), the external forces working on the object arc countered by 
internal forces. The internal elastic force 


Elastic = (9-17) 

or 

deals with the elastic energy that is produced when the deformable object has a shape that is different from 
its rest shape. We want to model the energy in a way that resembles real-life elastic objects. For example, 
when compressing a rubber ball, it returns to its natural ball shape when the pressure is released. We can 
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Figure 9.4: The shapes of two curves depend on torsion. Image shows two curves, with same arch length 
and curvature metrics, but with different torsion. 


model this behavior for our deformable object, by requiring that the material points at the deformation 
should have an increased elastic energy. This ensures that the elastic force makes the material points 
want to seek their natural rest shape, e.g., the undeformed object. We further want the model to have 
a low elastic energy when deformation is small, and a high elastic energy when deformation is large. A 
simple, but reasonable, model of the elastic energy £(r(a, t)) in an object being deformed is the difference 
between the fundamental forms of the object in its deformed state and in its rest shape. We denote the 
fundamental forms of the object in its natural rest shape with a superscript 0; that is, G° is the metric 
tensor, and B° is the curvature tensor for the object in its natural rest shape. This elastic energy can be 
written as 

£ (r) = f \\G-G°\\^ + \\B-B°\\* da x da 2 , (9.18) 

j 

where 12 is the domain of the deformable object and ||-|| and j ■ |y arc weighted matrix norms. This 
model for the elastic energy has the nice properties that it is zero for rigid body motion and that it includes 
the fewest partial derivatives necessary to restore a deformable object to its natural shape. 

9.2 Model Relaxation 

In the previous sections we derived a mathematical model for a deformable object. The mathematical 
model is now converted to a discrete model with nice numerical properties and efficiently solvable on 
a computer. The numerical property we aim for is that small errors, due to imprecision and round-off, 
arc not amplified, but rather dissipated. Choosing this strategy has some tradeoffs. The motion is often 
damped by the dissipation. The more dissipation, the more damped the motion appeal's to be. In contrast. 
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the more damping, the more abrupt, violent, and destructive change can be applied to the deformable 
object, without the animation blowing up. From a computer graphics viewpoint, efficiency often implies 
taking very large, fixed time-steps, with comparatively low time-complexity. 

We start by simplifying the expression for the elastic energy. Thereafter, we work through the mathe¬ 
matics of calculating the variational derivatives of the elastic force, which leads us to another simplifica¬ 
tion. When this is done, we show how to discretize the simplified equation of motion in space. Finally, 
the resulting system, which has been converted from a partial differential equation to a system of coupled 
ordinary differential equations, is discretized through time and ready to be implemented. 


9.2.1 Simplification of the Elastic Force 

In the previous section, we derived an expression for the elastic energy, when the object was deformed 
away from its natural shape. Using the weighted Frobenius norm , also called the Hilbert-Schmidt norm 
[Weisstein, 2004], which is defined as 



(9.19) 


we can rewrite (9.18) as follows: 

£(r) =J ||G-G°||J + \\B-B°\\* da x da 2 , (9.20a) 

r 2 

= / J2 (%■ ( G H - G l f + & ( B V - K) 2 ) da i da 2- (9.20b) 

1 , 3=1 

The weighting matrices, 7 ] and £ will control the strength of restoration of the first and second fundamental 
form, respectively. 


9.2.2 The Variational Derivative 

With the above simplification of the elastic energy functional, we are now ready to find an expression for 
the elastic force, F e i as tic = ©p-- The variational derivative 

of a potential energy functional can be interpreted as a force as explained in Section 22.3. Calcu¬ 
lating the variational derivative also corresponds to finding the function that minimizes the functional as 
described in Chapter 21. That means that by calculating the variational derivative, we find an expression 
of the elastic force that seeks to minimize the elastic energy. 

To better follow the derivations, we split (9.20b) into two parts. The first, S, dealing with the first 
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fundamental form, and the other, T, dealing with the second fundamental form: 

2 

S= (9.21a) 

i,j =1 

2 

T = E & ( B H - B ij) 2 ■ <9-219) 

i,j= 1 

These terms will be developed individually, using the calculus of Variation (see Chapter 21). 

9.2.2.1 Variational Derivative of S 

We will now derive the variational derivative of S. The function S on expanded form is 

5 = 7711 (Gn - G?,) 2 + //, 2 (Gi2 - G° 12 ) 2 + rj2i {G 21 - G° 21 ) 2 + i l22 (G 22 - G° 22 ) 2 . (9.22) 

Since G is symmetric, we choose r/i 2 = r/ 2 1. Using the fact that G tJ = the expression becomes 

(dr 3r \\ (8r dr 0 V / c>r 0 

s - mi \d^'d^~ Gl1 ) + 2??12 V^T^ 12 J +r?22 V^T^ 22 

dr (a) 

Introducing the shorthand r ai = j)n , the equation becomes 


-S' = r/n (r ai - r ai - G^) 2 + 2 t/i 2 (r fl 
which may also be written as 


— Gu) + V22 (l*a 2 ' r a 2 ~ G 22 ) 


« \ 2 


(9.23) 


(9.24) 


S(r)= / S (ai,a 2 ,r,r ai ,r a2 )daida 2 . 
Jn 


The first variational derivative of S is defined by 

SS 


dr 


= Sr- d ai S rr ,. - daoS: 


0,2 u r a 


Observe that nowhere does S depend explicitly on r, so S r = 0. That means we end up with 

+ 47712 (r ai ■ r a 2 - G? 2 ) r a 


6S - 0 

v — 

or 


-d, 


CL2 


' 



4r/i 1 (r ai 

•rai 

-G° u ) 


OL 11 


4t?i 2 (r ai 

‘^a 2 

-g° 12 ) 


OL12 



&12 


-g° 22 ) 


CL2 


(9.25) 


(9.26) 


(9.27) 
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Cleaning up we have 


SS 

dr 


-d ai [anr ai + ai 2 r ffl2 ] 


Using the product rule we can now write 


da 2 [«12^ai + Ot22r a2 ] ■ 


SS 

5r 


{9aiOiii)v ai CTll^aiai (^ai“l2) ^ a 2 dl2^aia 2 
(da 2 «12) T ai ~ ai2^a 2 ai — (5a 2 «22) ? a 2 “ «22^a 2 a 2 , 


which can be reduced to 


(9.28) 


(9.29) 


= (9.30a) 

*d=i 

“O' = Vij (Gij - G°) , (9.30b) 

fjij=4r]ij. (9.30c) 

This is the expression of the variation of the first fundamental form. The constant r)ij is user specified, 
hence we might as well consider fj^ as the user-specified parameter. This will be our practice in the 
remainder of this chapter. 


9.2.2.2 Variational Derivative of T 

Following the same procedure for T, using the fact that B;j = n ■ r aj<H , and choosing £12 = £ 21 * yields 

T = 61 {n ■ r aiai - B^) 2 + 2 £, 2 (n • r aia2 - B° l2 ) 2 + £22 [n ■ r a2a2 - B% 2 ) 2 . (9.31) 

We can then write the T -paid of the energy functional as 

T (r) = / T (ai,a, 2 ,r, r ai ,r a2 ,r aiai ,r aia2 ,r a2a2 ) da\da 2 - (9.32) 

Jn 

The first variational derivative of T is defined by 

b'T 

= T r — d ai T rai — d a2 T ra2 + d aiai T Vaiai + ‘2da 1 a 2 Tr ai a 2 + da 2 a 2 Tr a2 a 2 - (9.33) 

B—, , ,—© 
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Again, we see that T r = 0, so 


5T 

Sr 


T0 aiai &r 

4“29 ai 


9 r ai 

61 

( n 

T a\a\ 

pO N 

" “n, 

) 2 + 

262 

(n • 

l’aia 2 

- ^ 2 ) 2 

+ 

62 

(n • 

'f'(l 2 CL 2 

pO 

“ ^22 

9 r a2 

6i 

( n 

aiai 

p0 N 

■ “n, 

) 2 + 

262 

(n • 

l’aia 2 

- ^ 2 ) 2 

+ 

62 

(n • 

'f'(l 2 CL 2 

pO 

“ ^22 


6i 

( n ' 

aiai 

p0 N 

■ “n, 

> 2 + 

262 

(n • 

r*aia 2 

- ^ 2 ) 2 

+ 

62 

(n • 

CL 2 CL 2 

pO 

" ^22 

r 'a 1 a 2 

61 

( n ' 

T a\a\ 

p0 N 
■ -“11, 

) 2 + 

262 

(n • 

' r a\a 2 

- ^ 2 ) 2 

+ 

62 

(n • 

'f* CL 2 CL 2 

pO 

^22 

r a 2 a 2 

6i 

( n 

V* a\a\ 

pO N 
- “11, 

) 2 + 

262 

(n • 

r*aia 2 

-^ 2 ) 2 

+ 

62 

(n • 

1 *C 12 C 12 

pO 

" ^22 


(9.34) 


Taking care of the first partial differentiation using the differentiation rule in (18.173), gives 


ST 

5r 


- <9 ai 

261 

(n ' l^aiai 

-B° u ) 

n l ai r a iai + 462 (n • r a 102 - S? 2 ) nl ai r aia2 


+262 (n • i“a 2 a 2 - £22) n^ ai r a2Q2 


-da 2 

261 

(n • raiai 

-B° u ) 

n la 2 r a iai + 462 (n • raia 2 - 5? 2 ) 


+262 (n • l“a 2 a 2 - £22) n r„/a 2 a 2 


Tt/aiai | 

261 

(«■ • r ai ai 

-B° u ) 

n] 


+2<9aia 2 

462 

(n • r a i a2 

-b° V2 ) 

n] 


Tt/a 2 a 2 | 

262 

(n • r a2CL2 

- b° 22 ) 

n] . 



(9.35) 


As for the .S'-case, we introduce a symbol representing the paid of the equation that concerns the funda¬ 
mental form. Specifically, we set 

fill =2£n (n • r aiai - B^) = | n (n • r aiai - B^) , (9.36a) 

P12 =4^2 (n • r aia2 - B° X2 ) = 6 2 (n • r aia2 - Bl 2 ) , (9.36b) 

/?22 =262 (n • r a2a2 - ^22) = 62 (n • r a2a2 - 5 22 ) . (9.36c) 

The parameters 6 j are user specified. For rjij, for convenience we introduce 6 j as the new user-specified 

parameters, and finally we may write the variational derivative of T as 


ST 

Sr 


~d ai [ 
~ d a2 
T d aiai 


Pn n r ai r aiai + Pnnl ai r aia2 + 0 22 n 

Punier aiai + aia2 + (3 22 n 

Pn n + 2d ai a 2 f3i2n + 9 a2a2 /3 22 n. 


CL2CL2 

f*CL2CL2 


(9.37) 
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This equation is the variational derivative of the part of the energy-functional that concerns the second 
fundamental form. As can be seen, it is rather complex, so we would like to approximate it with something 
simpler, which can be done in many ways. 

In [Terzopoulos et al., 1987], an approximation of the T -part is chosen by an analogy to the S'-part 
without any theoretical justification. Our derivation shows how this approximation can be justified. First 
we ignore all first-order terms, which seems reasonable since curvature describes a second-order property. 
Then we replace the normal n with r aiaj , which can be justified as follows: the first derivative of a point 
on a curve is the tangent, the second derivative corresponds to a normal. We then arrive at the following 
approximation: 

~ d aiai Pll^aiai 2<9 ai CL2 Pl2 TCLICL2 “F ^0,20,2 /^11 0,20,2 * (9.38) 

As can be seen, the above approximation is very si mi lar to (9.30). The only difference is the different 
multiplicative constant hidden in the (3\2 term. Rescaling this term can be done without changing the 
fundamental properties of this term. To see this, we can study the analogy with gradient descent. Here, 
the negative gradient always points in the minimum direction. Thus, the direction remains the same, 
regardless of any chosen multiplicative constant. This implies that during minimization, we always walk 
in the direction of the minimum. The multiplicative term only determines how long the steps arc. With 
this justification, we arrive at the final approximation for the T-term: 

ST ~ 

— « daidj ( Pijraictj ) , (9.39a) 

i,j =t 

Pij =iij (n • r aiaj - B®j) . (9.39b) 


9.2.2.3 The Complete Variational Derivative 

This concludes the derivation of the approximating vector expression for the variational derivative of the 
energy potential, yielding the following relation: 

58 2 

e = ~ e (r) = ^ -d (H (a^.) + d aiaj (/%r aia .) , (9.40a) 

i,j= 1 

OLij =Vij ( Gij - Gij) , (9.40b) 

Pij =iij ( Bij - S° ) . (9.40c) 

The size of the elastic force vector, e, describes the elasticity inherent at a point in a deformable object. 
This is an internal property of the object, and it increases as the object deforms at the intrinsic position. 
So, the elasticity, e, is the elastic force that should work on a point to minimize the deformation. 

9.2.3 Understanding the Model 

The two constitutive equations (9.40b) and (9.40c) model the governing behavior of the elastic motion. 
Comparing them with Hooke’s Spring Law, described in Section 22.4, which states that 

F = — kx, (9.41) 
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where a; is a displacement vector, and k is the spring constant, we can see a resemblance. The rj and £ 
tensors both work as spring constants, albeit without the — sign, and the difference of the fundamental 
form of the deformed object, from the fundamental form of the undeformed object, can be seen as dis¬ 
placement. This means, that the behavior of the elasticity is actually a spring model, with the fundamental 
forms as the units of measurements. 

The first paid of (9.40a), can be seen as the tension constraint on the elasticity. Inspecting the metric 
tensor. 


G 


Gn 

G21 

G12 
G22 



(9.42a) 

dr 

dr 

dr 

dr 


Pa 1 
dr 

da 1 
dr 

da 1 
dr 

da 2 

dr 

, (9.42b) 

dci 2 

da 1 

da 2 

da 2 



we can see the significance of the different entries. The two elements, Gn and G 22 , are the squared 
vector norm of the derivative in each of the coordinate directions. These entries measure the length in 
each direction, and compared with the rest state, G°, determine stretching or compression of the material. 
The entries G 12 = G 21 measure the angle between the two coordinate directions, since the dot product is 
defined as 


cos 9 

x ■ y = -—-—; 

\ x \\y\ 


(9.43) 


Together with G°, they represent an angle constraint. 

The constants fjij contains the weights of the metric tensor, and can be written as 


V 11 V 12 

V 21 f/22 


(9.44) 


with fjij > 0. The fjn and 7)22 terms describe the resistance to length deformations in the two directions, 
while 7712 = 7721 describe resistance to shearing. To simulate a material that behaves like soft rubber, 
one can use a small fj, while a larger rj will model a less stretchable material, such as paper. In prac¬ 
tice, it is seen that, if some fjij > 1 and for large time steps, then the system tends to become unstable 
[Kelager et al., 2005]. 

The second paid of (9.40a), can be seen as the curvature constraint on the elasticity. Inspecting the 
curvature tensor, 


Bn 

B 12 


B21 

B22 


n 

d 2 r 

„ d 2 r 

da 2 

da\-da 2 

n ■ 

d 2 r 

d 2 r 

n ■ d-k 

da% 

da 2 -da\ 


(9.45a) 

(9.45b) 


we find that B n and B 22 measure bending in each of the two coordinate directions, with B 12 = £>21 
measures twisting of the coordinate frame projected onto the normal. Comparing the values with the rest 
state, B°, we have a measure of how the surface deforms with regal'd to bending and rotation. 
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The constants Gj contains the weights of the curvature tensor. These can be collected in a rigidity 
tensor 


1 = 


In 

.£21 


ft2 

62,. 


(9.46) 


where £n and £22 describe the resistance to bending, and £12 = £21 describes resistance to twisting. To 
model surfaces with flexible bending properties, like cloth, £ should be close to zero. A thin metal plate, 
which has a high resistance to bending, should have a larger £. In practice, it is often seen that the system 
stays stable, by keeping the values £jj < 1 [Christensen et al., 2004]. 


9.3 Discretizing the Model Relaxation 

In the previous section, we derived a simplified approximation of the original parti al equation given in 
(9.18). This approximation resulted in a vector expression, repeated here for convenience: 


2 

e = 'y ' —&ai ( OtijTaj ) + d aia . ( f3ijV ai aj) , (9.47a) 

i,j=1 

otij =fjij ( Gij - G° ) , (9.47b) 

pij =iij ( Bij - B%) . (9.47c) 


This expression for the elastic force is continuous in the material coordinates a of the deformable object. 
To be able to implement this, we need to discretize it, thereby transforming the original partial differential 
equation (9.8) into a system of ordinary differential equations. This system can then be converted to a 
matrix-vector form, which is easily implemented and solved on a computer. In this section, we will use 
the standard technique of finite difference approximations as described in Section 20.2, to calculate the 
different derivatives involved in the expressions. The deformable surface is described by a continuous 
mapping from material coordinates a = [ 01 , 02 ]^ to world coordinates r (a) = \r x , r y , r z 1 . The domain 
of the material coordinates is a unit square, 0 < ai,a 2 < 1. We discretize the material domain into a 
regular M x iV-sized grid of nodes, with an internode spacing of h\ and /p, given by 


hi = 


1 

m - r 


h r 2 


1 

N-l' 


(9.48a) 

(9.48b) 


This is illustrated in Figure 9.5. We will use square brackets to indicate that we approximate a continuous 
function r ( 01 , 02 ), on the discrete grid, where 0 < m < M — 1 and 0 < n < N — 1. For example, 
r [m, n] = r (mh\,nh 2 ). 

By analogy to particle systems presented in Chapter 8, a grid node can be thought of as a particle. 
The metric tensor G can then be thought of as structural springs, and the curvature tensor B as bending 
springs. 
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Material Coordinates World Coordinates 



Figure 9.5: Discretization of the material coordinate domain. 

9.3.1 Finite Difference Operators 

To ease notation in the following, we will introduce symbolic names for the finite difference operators 
needed. The symbol D indicates a finite difference operator. The subscripts denote the material coordinate 
axis along which the difference is performed. The superscript + indicates a forward difference, while a — 


indicate a backward difference. 

The forward first difference operators arc defined by: 

Df(u) [m,n] = (u [m + l,n] — u [m,n]) /hi, (9.49a) 

D 2 (u) [m,n] = (u [m,n + 1] — u [■ m,n ]) /h- 2 - (9.49b) 

Similarly, the backward first difference operators arc defined by 

Df(u) [m,n] = (u [m,n} — u[m— l,n]) /h\, (9.50a) 

Df{u) [m,n ] = ( u [m,n\ — u [ m.,n — 1]) /h- 2 - (9.50b) 

The forward and backw ard cross difference operators, can then be defined as 

Df 2 (u) [m,n} = D 2l (u) [m,n] = Df (ZT^(-ii)) [ m,n} , (9.51a) 

Df 2 (u) [m,n\ = Df^u) [m,n] = Df (i9^(w)) [m,n\ . (9.51b) 

The central second difference operators can then be defined as 

D\\(u) [m,n} = Df [ m,n ] , (9.52a) 

D 2 2 (u) [m,n\ = Df (D^(u)) [m,n\ . (9.52b) 


Finally, we define combined second difference operators, I)\+ ] and d\- \ that means: use the forw ard 
or backward cross differential operator, when i j, and the second central difference operator Da, 
otherwise. 
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9.3.2 Discretizing the Elastic Force 

With the above operators, it is very easy to write up the discrete versions of the elastic force F e | ast j c . First, 


the two continuous versions of the constitutive functions, a and /?, given by 

/ F)t* \ 

r) = rjij(a) (G tj - G°) = %'(“) ( ^ - Gy) , (9.53a) 

( d 2 r \ 

i%j(a , r) = Zijia) (B t3 - = &j(a) [ n ■ ^ ^ - B^J , (9.53b) 

arc discretized to yield 

atij [m, n\ = rjij [m, n] (D+ (r) [m, n] ■ Df ( r ) [m, n} — G^j [m, n]) , (9.54a) 

/3ij [m, n\ = iij [m, n\ (n [m, n\ ■ ( r ) [m, n] - [m, n]^ , (9.54b) 


where the initial tensors, G° and B°, can be calculated analytically, or by using the same finite difference 
operators on a discretization of the rest shape. The finite difference operators appeal's to give higher 
stability. Hence, the continuous elastic force, given by 

_ ^ d f dr\ d 2 f d 2 r \ 
dai \ ' 12 da,j ) dciidcLj \ 13 daidaj ) ’ 

can be discretized to yield 

2 

e [to, n\ = y~] -D~ (p) [to, n\ + ( q ) [to, n \, 

i,j =t 

p = aij [to, n\ D t (r) [m, n ], 
q = [to, n\ D^ ] (r) [to, n\ . 

Notice that the final discretization uses a combination of forward and backward difference operators. This 
combination results in a central difference approximation for r. 

The elastic force has been separated in two parts, p [m, n and q [to, n], to emphasize the fact that 
they are fields of the same size as the discretized surface r [ to , n\. It therefore makes sense to apply the 
finite difference operators on these fields as well. That means, to calculate the elastic force e for a given 
material point r [to, n], we only need to look in the neighborhood of the material point. This is the key in 
understanding how to perform the actual implementation, which is described next. 

9.3.3 The Stiffness Matrix 

With the discretization in place, we now want to derive a matrix-vector equation for the elastic force. In 
the following description, we look at a deformable surface consisting of M x N material points. Let 


(9.55) 

(9.56a) 

(9.56b) 

(9.56c) 
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us collect all the particle positions, r [m,n\, in one large MN dimensional vector, r. We then want to 
reformulate the discretization, in such a way, that we can calculate the elastic force of all the particles 
simultaneously, by the expression 

e = K(r)r, (9.57) 

where e is an MN dimensional vector containing the elastic force for all material points. The matrix 
K(r), with dimensions MN x MN, is called the stiffness matrix. 

To assemble the stiffness matrix, we need to have a closer look at the discretizations of the elastic force. 
We will evaluate the finite differences to see which material point positions arc involved in calculating the 
elastic force for a single material point. We will discover that the material points involved all lie in a close 
neighborhood, which gives an analogy to discrete filtering kernels, also called stencils. 


9.3.3.1 The a Stencil 


Expanding the first paid of (9.56a), we get 

2 

—D~(p) [to, n] = — D (an [to, n] Df (r) [m,n]) (9.58) 

i,j= 1 

- D (a i2 [to, n\ ( r ) [to, n]) 

- D 2 (a 2 1 [to, n\ Df ( r ) [to, n]) 

- D 2 (a 22 [to, n] ( r ) [to, n]) . 

Taking only the first paid of this, and evaluating, first the backward difference operator, then the forward 
difference operator, we find 

—(an [m,n] Df (r) [to, n]) 

—an [m, n] D+ ( r) [to, n] + an [to — 1, n] (r) [to — 1, n] 


hi 

—an [m, n} (r [to. + 1, n] — r [to., n]) + an [to — 1 , n] (r [to, n] — r [to — 1, n]) 

: K 


(9.59a) 


(9.59b) 


an [m,n] a u [m, n] + an [m - 1, n] an[m-l,n] 

-r [to + l,n| H- —2 -r [to, nj--r [to — l,nj . 






hi 


(9.59c) 


Doing the same for the other parts of the summation, and gathering terms, we can write the results as a 
3x3 stencil, 


0 

cx2i[Tn,n—l\ a22[iri,n—l\ 

hih.2 h'i 

ot .21 [m,n— lj 
hih.2 

an[m—l,n\ ai 2 [?n—l,n] 

an [m,n\-\-aii [m— l,nj 

h i 

i Qii 2 [m,n]-\-a 2 i [m,n\ 

an[m,n] a 2 i [m,n] 

hih.2 

h\h,2 

1 a.22[m.n\+a22[m,n— 1] 

+ h% 

h'f hih.2 

ai 2 [m—l,n\ 
hih,2 

Q;i 2 [m,nj oi 22 [m,n\ 

hih .2 h'i 

0 


(9.60) 
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Algorithm compute-alpha() 
for m=0 to M-l 
for n=0 to N-l 

alpha_ll[m][n] = alpha_12[m][n] = alpha_22[m][n] = 0 
vector3 Dll_r = (r_cur[m+l] [n] - r_cur[m-l] [n])/ 2 h_l 
vector3 D22_r = (r_cur[m] [n+1] - r_cur[m] [n-l])/ 2 h_2 
alpha_ll[m][n] = eta_ll[m][n]*( Dl_r * Dl_r - G0_11[m][n]) 
alpha_12[m] [n] = eta_12[m] [n]* ( Dl_r * D2_r - G0_12[m] [n]) 
alpha_22[m][n] = eta_22[m][n]*( D2_r * D2_r - G0_22[m][n]) 
next m 
next n 

End algorithm 


Figure 9.6: Pseudocode showing how to compute the alpha grid. Von Neumann boundary conditions as 
described in Section 9.3.7 should be applied to the grid of current particle positions. 


corresponding to the grid positions, 


[m — 1, n — 1] 

[m, n — 1] 

[m + 1, n — 1] 

[m — 1, n] 

[m, n] 

[m + 1, n\ 

[m — 1, n + 1] 

[m, n + 1] 

[m + 1, n + 1] 


Now the o-stcncil, S a , can be seen as a filter that can be applied on the positional grid, r. When applying 
this filter at position r [m, n\, we effectively calculate (9.58). 

Figure 9.6 illustrates in pseudocode how to compute a. 


9.3.3.2 The p Stencil 

Expanding the second paid of (9.56a), we get 

2 

D H (g) [m, n] =D U (3 U [m, n\ D u ( r ) [m, n\ (9.62) 

i,j =1 

+ D^ 2 p 12 [m, n\ Df 2 (r) [m, n] 

+ D 2 i /?21 [m, n\ D 2l (r ) [m, n] 

+ D 22 P 22 [m, n\ D 22 {r) [m, n] . 


As for the o-stcncil. we can expand this expression fully by substituting the expressions for the finite 
difference operators, and collect terms. This time, due to the use of the central and cross difference 
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operators, the result is a 5 x 5 stencil. 


0 

0 

Sp [to, ti — 2] 

0 

0 

0 

Sp [to — 1,71 — 1 ] 

Sp [to, 71 — 1 ] 

5/3 [to + 1,71 - 1 ] 

0 

1 

to 

Sp [to — 1 , 7 l] 

03 [to, n] 

Sp [to + 1 , n] 

Sp [to + 2, ti] 

0 

Sp [to - 1 , 71 + 1 ] 

Sp [to, 71 + 1 ] 

5/3 [TO + 1 , 71 + 1 ] 

0 

0 

0 

5/3 [to, ti + 2] 

0 

0 


where the nonzero entries in each row arc given by: 


Sp[m,n - 2] = (0 22 [m,n- 1]) //if, (9.64a) 

Sp[m — l,n — 1] = (P 12 [to — 1,ti — 1] +0 2 i [to — l,n — 1]) /(/if/i|), (9.64b) 

03[to,ti - 1] = (—/?12 [to, tt. - 1] - 0 i2 [to - 1,ti - 1] - /? 2 i [to, n - 1] - 0 2 1 [to — l,n — 1])/(/if/if) 

+ (—2/022 [to, n] - 2/022 [to, n - 1]) //zf, (9.64c) 

03 [to + 1,71 - 1] = (/?12 [to, n - 1] + 02i [to, n - 1]) /{h\h\), (9.64d) 

03 [to - 2, n] = (/?n [to - 1, n]) //zf, (9.64e) 

03[to — l,n] = (—2/3ii [to, n] — 20n [to — l,n]) //if 


+ (—/3i2 [to - 1, n] - / 0 i 2 [to - l,n - 1] - /0 2 i [to - l,n] - /0 2 i [to - l,n - 1]) /(h\h\), 

(9.64f) 

03 [to, n] = (/0ii [m + 1,n] + 40n [to, n] + 0n [to - 1, n]) //if 

+ (012 [to, n] + 0 i 2 [to, n - 1] + 02 i [to - 1, n] + 02 i [to - 1, n - 1]) /(/zf/z|) 

+ (021 [to, n] + 0 i 2 [to, n - 1] + 02 i [to - 1, n] + 02 i [to - 1, n - 1]) /(/if/if) 


+ (022 [to, n + 1 ] + 4022 [to, n] + 0 22 [to, n - 1 ]) /hj, (9.64g) 

Sp [to + l,n] = (-20H [to H- 1, 77 .] — 20H [to, ra]) //if 

+ (-012 [to, n] - 0i2 [to, n - 1 ] - 02i [to, n] - 0 2 i [m, n - 1]) /(/if/i|), (9.64h) 

S/3[to + 2, n] = ( 0 n [to + 1, n]) //if, (9.64i) 

^[to- 1,71+ 1] = (012 [to - 1,n] + 021 [to - l,7i])/(/if/i|), (9.64j) 

Sp[m,n+ 1] = (-012 [ill, 71 ] — 012 [to — 1, 7 l] — 021 [TO, 71 ] - 021 [TO - l, 7 l]) /(/lf/lf) 

+ (-2022 [to, 71 + 1 ] - 2022 [to, 7 l]) //if, (9.64k) 

09 [to + 1 , 71 + 1 ] = (012 [to, 71 ] +021 [to, 7 l]) /(/lf/ll), (9.641) 

03[to, 7i + 2 ] = (022 [m,n + 1 ]) //if. (9.64m) 


Again, when applying the Sp stencil to a node, given by r [to, n] in the positional grid, the result is an 
evaluation of (9.62). 

Figure 9.7 illustrates how to compute 0 in pseudocode. 




© 


“book” — 2005/9/30 — 15:44 — page 323 — #335 


£ 


© 


© 


9.3 PHYSICS-BASED ANIMATION 


323 


Algorithm compute-beta() 
for m=0 to M 
for n=0 to N 

beta_ll[m][n] = beta_12[m][n] = beta_22[m][n] = 0 
vector3 Dll_r = (r_cur[m+1][n] - r_cur[ra-1][n])/2*h_l 
vector3 D22_r = (r_cur[m][n+1] - r_cur[m][n-l])/2*h_l 
vector3 D12_r = (r_cur[m+1][n+1] - r_cur[m+l][n] 

- r_cur[m][n+1] + r_cur[m][n])/h_l*h_2 
beta_ll[m][n] = xhi_ll[m][n]*(n[m][n]*Dll_r - B0_11[m][n]) 
beta_12[m][n] = xhi_12[m][n]*(n[m][n]*D12_r - B0_12[m][n]) 
beta_22[m][n] = xhi_22[m][n]*(n[m][n]*D22_r - B0_22[m][n]) 
next m 
next n 

End algorithm 


Figure 9.7: Pseudocode showing how to compute the beta grid. Before invocation the normals for each 
grid node must have been computed. Von Neumann boundary conditions as described in Section 9.3.7 
should be applied to the grid of current particle positions. 


9.3.3.3 The Combined Stencil 

To evaluate the complete (9.56a), we combine the S Q and Sg stencils, arriving at: 
5 C = 


0 

0 


0 

0 

0 


S a [to, n — 1] 
+Sg [to, n - 1] 

S a [to + 1, n — 1] 
+5/3 [to + l,n — 1] 

0 

to 

Be 

1 

to 

S a [to — 1, n] 

+Sg [to - 1 ,n] 

S a [to, n] 

+Sg [to, n\ 

S a [to + l,n] 

+5/3 [to + l,n] 

5/3 [to + 2, n] 

0 


S a [to, n + 1] 
+Sg [to, n + 1] 

S a [to + l,n + 1] 
+5/3 [to + l,n + 1] 

0 

0 

0 

Sg [to, n + 2] 

0 

0 


(9.65) 


This stencil includes all of (9.56a) and contains all the information needed to understand how to assemble 
the stiffness matrix K. 

9.3.3.4 Assembling the Stiffness Matrix 

The analogy to filter operations in the above derivations of the stencils can now help us to understand how 
to construct the stiffness matrix K. The result of the matrix-vector multiplication, e = K(r)r, is that e 
contains the elastic force for all the material points described by r. Unfolding the positional grid to the 
vector r can be done as shown by the pseudocode shown in Figure 9.8. A single row in K should mask 
out the needed material points in r, and apply the combined stencil S c on these particles. For the sake 
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for i = 0 to MN-1 do 
m = div(i,M) 
n = mod(i,N) 
r ( i ) = r [m, n] 
next i 


Figure 9.8: Pseudocode for unfolding positional grid. 



Figure 9.9: Row k, of the stiffness matrix K comes from unfolding the combined stencil S c . 


of clarity, let us look at how to calculate the elastic force e, for the particle r t , where position i in the 
M/V-vectors coiTesponds to index [m, n] in the grid. Unfolding the elasticity grid and the positional grid, 
the matrix-vector equation (9.57), will look like this: 


o . 
0>l 

1_ 


r k T i 


1 - 

3 • 

e* 

= 

^1 


Li 



k T 

L^mat-iJ 




(9.66) 


where k/ / is the rows of the stiffness matrix K. The row, kj, corresponding to the elastic force calculation 
for particle r,, looks like Figure 9.9. 

That is, the combined 5x5 stencil, S c , has been unwrapped and the individual rows has been placed 
in the proper positions in the corresponding row of the stiffness matrix. Note that all the other entries arc 
zero, which, in effect, ensures that only the needed particles arc used in the calculation. 

The other rows of K arc just shifted version of row k ( . This results in a stiffness matrix with a very 
nice structure, as seen in Figure 9.10. It only contains five subbands, each with five bands, with all the 
other entries being zero. So it is a square, banded, sparse matrix. This again means that we can use all 
sorts of nice matrix vector solvers to quickly calculate the elastic force of all the particles. 

In Figure 9.11 the assembly of the stiffness matrix is outlined in pseudocode. 
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Figure 9.10: Structure of the stiffness matrix K. 


Algorithm assembly-K() 

K = 0 

for i = 0 to MN-1 
m = mod(i,M) 
n = div(i,N) 
for s = m-2 to m+2 
for t = n-2 to n+2 
K(i, sM+t) = Sc(s,t) 
next t 
next s 
next i 

End algorithm 


Figure 9.11: Pseudocode for assembly of the stiffness matrix K. 
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9.3.4 A System of Ordinary Differential Equations 


Having dealt with the elastic force, we arc ready to look at the rest of the equation of motion (9.8), that is, 


11 "(“) 


dr (a, t ) 
dt 


+ T(a) 


dr(a,t ) ( 5£(r(a,t )) 
5r(a, t ) 


dt 


+ 


= f(r(a, t),t). 


(9.67) 


In the general case, the density distribution function could be dependent on time, both directly, and in¬ 
directly, through the material coordinates, as in /i(a(t), t). However, for our specific purpose, we will 
consider the density distribution function to be directly dependent upon the material coordinates only. 
This is justified by creating a fixed rectangular grid embedded in the object, where nodes correspond to a 
fixed material coordinate point. This simplifies the equation of motion to 


ri( a ) 


d 2 r(a , t) 
dt 2 


+ 7 («) 


dr(a,t ) | 5£(r(a,t )) 
5r(a, t) 


dt 


+ 


= f(r(a,t),t). 


(9.68) 


For the elastic force, we have chosen to approximate with K(r)r, where r is the MN element-sized 
concatenated vector of particle positions. To be able to use r everywhere in (9.68), we need to represent 
the discrete versions of the mass densities, p, [ m,n ], and the damping densities, 7 [m, n], as well as the 
external forces, in a proper form. 

For the discrete mass densities, p.[m,n\, we construct a diagonal MN x M TV-dimensional mass 
matrix M with the // [m, n] values in the diagonal. Similarly, we construct a damping matrix C with the 
discrete damping values 7 [m, n], in the diagonal. Finally, we create a MN- sized vector f, that gathers all 
the discrete external forces. 

With these constructs, we arc able to construct the fully spatial discretization of (9.68) as: 

M || + c| + K(r)r = f. (9.69) 

The original formulation, which is a system of nonlinear partial differential equations, has now been 
converted to a system of nonlinear ordinary differential equations. 


9.3.5 Temporal Discretization 


To simulate the deformation, the system of ordinary differential equations, (9.69), needs to be integrated 
through time. In order to favor stability and robustness over accuracy, an implicit integration scheme 
can be applied. For ease and speed, we also want the system to be linear, which is not the case for the 
system in (9.69). However, linearizing the system through a series of numerical steps will result in the 
semi-implicit method known as the Terzopoulos method [Terzopoulos et ah, 1987]. 

The time-dependent factors in (9.69) can be approximated by the discrete first- and second-order 
accurate central differences given by 


<k = h±At - r -t-At 
dt 2A t 

d 2 l = h+At ^ 2 lt + It-At 

dt 2 At 2 


(9.70a) 


(9.70b) 
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where the subscripts on r means evaluate r at the time indicated by the subscript. This results in a linear 
approximation to the nonlinear system. Now, we evaluate fat time t, and e at time t + At. That is: 


et+Af = 


(9.71) 


These evaluations, together with the discrete time approximations from (9.70a) and (9.70b), arc plugged 
into (9.69) to yield: 

ni/rlt+Ai "^-t ~h It—At . ,-ili+Ai It-Af , /o 


+ K( 

It)lt+At ~ ft’ 


(9.72) 


which can be expanded to: 


^2 M 4+Af - + ^2 M It-At + ^ Cl -t+At ~ t-At + K (lt)ft+At = t f (9-73) 


We then collect iy +Ai terms on the left-hand side, and get: 

+ 2At C ) lt + At = -t + aT2 :Ml -t ~ ( W72 M “ oa 7 C 1 -t—At - 


The left-hand side can be written as 


At 2 2A t 


A tl t +At with A t = Kfe) + . 


(9.74) 


(9.75) 


To arrive at an implicit scheme, we want the right-hand side to have a + -^yC) r t -term. We 

expand and reaiTange, and get: 


a *+a< =f< + - ( AjM _ A- C ) r,_ at 

+ + A^ Mlt ~ At? Mlt ~ At + 2At Clt ~ At - 


(9.76a) 


(9.76b) 


By adding and subtracting the term -^-Ciy, we contribute nothing to the equation. Doing this and rear¬ 
ranging again, we get 

Atlt+Af =ft + djMs + - Tmtaa, + + A-Cr, - Ay C r, (9.77a) 

=U + + A_ C£ , + Amj - Wq, - A-Mr_ at + ^Cr,_ it (9.77b) 

A -+ (^ M + #C) r, + (^M - Aj C ) r, - (^M+ Ajc) U_ AI ,9.77c) 


-£< + ( + ^ c ) b + ( -m c )^~ E ‘- A,) 
=£ > + ( AJ5 M + 2^°) U + (it M “ 5°) ZI “ E, - A,) 

A + (^ M + 2 f; c )& + 04 c K 


(9.77d) 


(9.77e) 


(9.77f) 
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Algorithm time-step(dt) 

M = mass matrix... 

C = damping matrix... 
r_cur = current position... 
r_old = old positions... 
f = external_forces... 
assemble_stiffness_matrix(K) 
MpC = ( M/(dt*dt) + C/(2*dt)) 
A = K + MpC 
MmC = (M /dt - C/2) 
v = (r_cur - r_old)/dt 
g = f + MpC*r_cur + MmC*v 
solve(A,r_new,g) 
r_old = r_cur 
r_cur = r_new 
End algorithm 


Figure 9.12: Complete pseudocode for a single time-step. 


In the last line, we recognized and substituted the implicit given velocity at time t, Yt = St (it — It-At) • 
Looking at the Si-units of the right-hand side, we find that all terms have dimension of force. Therefore, 
we can call the right-hand side for the effective force vector, g_ t . 

To summarize, we have derived a semi-implicit integration scheme described by the sparse, banded, 
lineal - system 

A tlt+At = (9-78) 


with the system matrix 


and the effective force vector 


A,=K(r t)+ (T.M + T i C|, 


1 , „ 1 


& = &+(^ m + ^ c U + (- m + t c )^ 


1 ,, 1 


At 2 


(9.79) 


(9.80) 


with 

y.t = ^ {h ~ It-At) ■ (9-81) 

Figure 9.12 shows a pseudocode version of the complete algorithm, using the pseudocode from Fig¬ 
ure 9.11. 


9.3.6 Numerical Issues 

Given initial conditions, r 0 and y 0 , the above sparse linear system can be solved by inverting the system 
matrix 

Et+At = A t Si) (9.82) 


© 
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using an appropriate numerical solver, for example Choleski decomposition or LUfactorization. However, 
the real win comes from exploiting the properties of the system matrix A from (9.79). This consists of the 
stiffness matrix plus a positive diagonal matrix, so all the properties of the stiffness matrix still hold. This 
means that we can use an iterative solver, like the conjugate gradient method as described in Section 18.2, 
to solve the system matrix. This allow us to simulate deformations, represented by a large amount of 
material coordinates, and still be able to visualize the results in real time. 

If the entries in the stiffness matrix arc very small, these values will become very large, when inverted. 
This is a problem for any method of solving the system of equations, but a simple and effective solution 
is to clamp all contributions to the stiffness matrix that arc below a certain threshold, to zero. 

Another issue concerns the boundaries of the discretized grid. Some sort of boundary conditions need 
to be implemented, to ensure that the finite difference operators do not break down. The easiest way to 
implement a boundary condition is to extend the grid with a border all around it. This border can then 
be initialized with whatever value is proper for a specific boundary condition. The filter operations arc 
then defined on all values of the original grid. This is similar - to the way filtering is implemented in, for 
example, Matlab®. For example, a natural boundary condition, also known as a von Neumann condition, 
requires the gradient to be constant across the boundary. This can be simulated by ensuring that the 
forward and cross difference operators in (9.56b) and (9.56c) return zero. To implement this, the border 
values need to be set to the same values that are right inside the boundary. 


9.3.7 Von Neumann Boundary Conditions 


We will now show how to implement simple von Neumann boundary conditions on a rectangular - grid. 

Let r[i] [j] denote the grid node at position {i. j). Let the M x N computational grid be represented on 
a grid of dimension (M + 2) x (N + 2). The computational grid, that is, where we solve the Dynamics, 
is given by the ranges i e 1..M and j e 1..N. Indices outside these ranges are used to set up boundary 
conditions. 

The central difference of r at grid locations (1, •) in horizontal direction is given by the formula: 


Du{r[i][j}) 


r [i + l][j] — r[i — l][j] 
2h\ 


(9.83) 


Obviously, when i = 1, r[i — 1 ] [j] lies outside the computational grid. To handle this boundary case, 
imagine that the surface is smooth and continuous and that the surface has infinite extension. It is then fair 
to assume that we can estimate the direction of r[i — 1] [j] by the tangent of the surface at r[i\ [j], as shown 
in Figure 9.13. That is 


r[0][j] = r[l][j] - (r[2][j] - r[l][j]) 
= 2r[l]\j}-r[2]\j}. 


(9.84) 

(9.85) 


Inserting this into the central difference scheme yields 


Dn(r[l}\j]) 


r[2][j] - (2r[l][j] — r[2][j]) 


2hi 

r[2][j] — r[l] [j] 
hi 


(9.86) 

(9.87) 
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Figure 9.13: An imaginary point to the left of the left boundary. 


This is recognized as a forward difference. Hereby we have shown that it makes sense to use Equa¬ 
tion 9.84. 

A similar case exists when i = M . This time r[i+ 1] [j] is undefined, and applying the same machinery 
suggests that 

r[M + 1] [j] = r[M}[j] - ( r[M - 1] [j] - r[M}[j]) (9.88) 

= 2r[M}\j] - r[M - l]\j], (9.89) 


In order to verify that the value is meaningful, (9.88) is inserted into the central difference formula 

(2 r[M][j] - r[M - 1 ][?]) - r[M - 1 ][j] 


D u (r[M\[j}) = 


2/ii 

r[M]\j] - r[M - 1 ][j] 

/n 


(9.90) 

(9.91) 


It is seen that this is in fact a backward difference. 

The same approach applied to the horizontal direction yields the boundary values 


r[i] [0] = 2r[i][l] - r[i\[ 2], (9.92) 

r[i][N + 1] = 2 r[i\[N] - r[i\[N - 1], (9.93) 

Notice that the four corner values are unused, that is, r[0][0], r[0][A"+l], r[M+l][0], and r[M +1][A"+1]. 
Also, no left and right values are set for the top and bottom row. Similar, no top and bottom values are set 
for the left and right borders. 

A central finite difference scheme for mixed partials can lead to trouble at the corner points because 
the values r[0] [0], r[0] [iV+l], r[M+ 1] [0], and r[M+l][iV+l] are undefined. Using the same assumption 
about a continuous smooth surface yields the computational low-cost estimations of the corner values 


r [0][0] = 2 r[l][l] — r [2] [2], (9.94) 

r [0] [N + 1] = 2r [1] [N] - r [2] [N - 1], (9.95) 

r[M + 1 ] [0] = 2 r[M][l] - r[M - 1 ][ 2 ], (9.96) 

r[M + 1 ] [N + 1] = 2 r[M\ [AT] - r[M — 1] [iV — 1 ]. (9.97) 
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Algorithm apply-neumann(r,M,N) 
for 1=0, j=l to N 

r[i-1] [j] = 2*r[i] [ j] - r[i + 1] [ j] 

for i=M, j=l to N 

r [i + 1] [j] = 2*r[i] [j] - r[i-1] [ j] 

for j=0, i=l to M 

r [i] [j-1] = 2*r [i] [ j] - r [i] [ j + 1] 

for j=N, i=l to M 

r [i] [j + 1] = 2*r[i] [ j] - r[1] [j-1] 

r [0] [0] = 2 r [1] [1] - r [2] [2] 
r[0][N+l] = 2 r;1;|N| - r[2][N-l] 
r[M+l][0] = 2 r[M][1] - r[M-l][2] 
r[M+l][N+l] = 2 r[M][N] - r[M-l][N-l] 
End algorithm 


Figure 9.14: Pseudocode showing how to apply von Neumann boundary conditions to a computational 
grid r, where a central difference scheme is applied. 


A more elaborate scheme can of course be used, but we favor the above due to its simplicity and the fact 
that corner points are based on known values and not other boundary values, set along the horizontal and 
vertical lines outside the computational grid. In Figure 9.14 the pseudocode for applying von Neumann 
boundary conditions is given. 

The boundary conditions we have applied are accurate enough to first order. Thus, if large discretiza¬ 
tions arc used (large ho and h\), then boundary values will have large errors. Furthermore, if higher-order 
difference schemes arc applied, that is, ()(n p ) with p > 1, then the scheme is more accurate than the 
boundary values. In the latter case one might want to devise a higher-order method for determining the 
off-surface points. This is, however, not needed for the Terzopoulos method, and we leave this as an 
exercise for the reader. 

9.3.8 Dirichlet Boundary Conditions 

Although not used in the Terzopoulos method, we will outline a simple implementation for Dirichlet 
boundary conditions. For this, a scalar - computational grid s is used, with the same dimension as in 
Section 9.3.7. A Dirichlet boundary condition can now be set as s [/] \j\ = 0, or any other constant value. 

These kind of boundary conditions are easily applied simply by setting values in the computational 
grid, before using it to perform any computations. 


9.4 The External Forces 

The external force, f(r,t), in our model is the sum of several different kinds of forces. These external 
forces can be used to add realistic behavior to the model. A list of some examples of such external forces 
are 


© 
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Gravity 

Gravity force is acting on everything that has a mass. It is given by 


/ gravity = /*(“) 9, (9-98) 

where g is the gravitational field. This is usually defined as a constant downward pointing value, 
g = [0, —9.82m/s 2 ,0 ] T , but could be set to any other constant value. 

Spring Forces 

Spring forces arc produced form springs between fixed material points, ao, and fixed points in the 
world, ro, 

f spring (f) = k(r 0 -r(a 0 ,t)). (9.99) 


Viscous Forces 

Viscous forces measure the resistance to motion in a fluid-like substance. For example, a viscous 
force is given as 


fviscous( a ^) =c(n(a,t) -v r (a,t))n(a,t) Va € T, (9.100) 

where c is the viscosity coefficient, the strength of the viscous force, n, is the unit surface normal, 
and v r (a, t ) is the relative surface velocity to the constant stream velocity u. That is, 

, . dr(a,t) 

v r (a, t) = u -——. (9.101) 

As an example of a viscous force, consider a ball rolling toward a puddle of water. When the ball 
rolls through the water, it will slow down, due to the viscosity of the water. 

9.4.1 Contact Forces 

Contact forces arc the key to interacting with the deformable model. Consider an impenetrable rigid 
object, for example, a stick, being pressed against the deformable object. Wherever the rigid object comes 
in contact with the deformable object, the material points of the deformable object need to be forced 
away, such that no penetrations occur. We will describe two methods to implement contact forces. One 
is by using penalty forces, as in [Terzopoulos et al., 1987]. The other is a projection method, as used in 
[Jakobsen, 2001], which utilizes the implicitly given velocity. 

Contact forces can be modeled by penalty forces, as follows. We define a potential energy function, 
V(r): 

V(r) = cexp ^^ . (9.102) 

Flere, c and e arc user-defined constants, used to fine-tune the shape of the potential. The function f(r) 
is the inside/outside function of the object. It provides a measure of the penetration depth at those places 
where an object is in contact with something else. In Figure 9.15, the potential energy function is plotted. 
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Figure 9.15: Potential energy function for c = 1 and different values of e. 


Conservative forces can be written as the negative gradient of a scalar field. Following this idea, we 
compute the negative gradient of the potential energy function: 


-W(r) 


V0(r) 

c- 

e 


exp 



(9.103) 


Finally, the contact force is determined as the projection of the negative gradient of the potential energy 
function onto the surface normal. 

/contact = - (vV • n) n. (9.104) 

By substitution, we derive the final formula for the penalty force 


contact 


r) = c 


Vf(r) 


■ exp 




n(r). 


(9.105) 


9.4.1.1 Collision Response 

The projection method is a relatively simple method for collision response : it projects material points that 
have penetrated an impenetrable object, back to the surface of the object. Since the numerical scheme 
described in Section 9.3.5 calculates the velocities of the material points implicitly, the position update 
does not result in the position and velocities becoming out of sync. If another numerical scheme is used, 
where the velocities are explicitly calculated, the synchronization of positions and velocities would be 
harder to ensure. The projection method corresponds to iterative over-relaxation. In the limiting case, it 
can be seen as infinitely stiff springs that immediately return material points to the wanted position. 

The projection method can be implemented as an extra step that is performed after all material posi¬ 
tions have been updated. Simply run through all material points and project the penetrating points back to 
the surface in the normal direction. A multiplicative constant, c > 0, can be used to control the strength 
of the projection. In this way, c can be thought of as a restitution coefficient. 
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The Finite Element Method 


In physics-based animation the Finite Element Method {FEM) is becoming the dominant method for an¬ 
imating deformable objects. The theory from mechanics is well established, but can be rather daunting. 
However, when applying a set of simplifying assumptions, the mathematics involved becomes much sim¬ 
pler. The following will concentrate on the special case of simulating deformable objects constructed from 
materials that arc isotropic and further assume that the deformations involved arc small. 

Continuum mechanics deal with the prediction and calculation of the effect of applying an external 
load to some object with physical properties. The theory of elasticity is the paid of continuum mechanics 
that deals with elastic materials, materials that return to their initial configurations, when the external load 
is removed. 

When studying the relationship between forces and deformation, some of the concepts needed arc 
stress, strain, equilibrium, and displacement. Stress is the strength of the force from interactions such as 
stretching, squeezing, or twisting, and stress is often characterized as force per unit area. Strain is the 
resulting deformation. The stress-strain relationship defines how a deformable object deforms under a 
given force. When forces arc applied to the object, it deforms to a configuration of points, in which the 
energy of the object is in equilibrium. The information needed for animation is the displacement of the 
vertices of the underlying mesh in equilibrium. 

This chapter is devoted to the basics of FEM in computer animation, and we will give the theoretical 
background for the static, isotropic stress/strain, finite element method, and we will show how FEM is used 
in animation. We will not treat fracturing [O'Brien et al., 1999, O’Brien et al., 2002a, Muller et ah, 2004a], 
cutting [Molino et al., 2004], nor diagonalization [Irving et ah, 2004] the recent finite volume method 
[Teran et ah, 2003]. We also only consider tetrahedral elements, unlike, for instance, cubical elements 
as in [Muller et ah, 2004b], 


10.1 Tetrahedral Geometry 


An often-used tessellation for surfaces is a triangular mesh. The corresponding tessellation for volumes 
is a tetrahedral mesh. These arc the building blocks of the finite element method, and therefore in the 
following we will shortly review tetrahedral geometry. 

A tetrahedron is defined by four nodes locally labeled 0, 1,2, and 3, and we will use the convention 
that the labels arc assigned such that the coordinates of nodes, xq, x\, x-2, and x :i span a positive volume, 

B-, , 335 -0 
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Figure 10.1: Geometry of a tetrahedron. 


that is 

V = i det ([xi -x 0 \x 2 -x 0 \x 3 - x 0 ]) 

Xi - x 0 X '2 - X'o at 3 - x 0 

= g yi - yo 2/2 -2/0 2/3 - yo 

Zl - 20 ^2 - A) 2:3 - 20 

> 0 , 

with | • | as the determinant operator. This is the case, when cco, ®i, and *2 form a positive cycle as seen 
from . 7 ; 3 . An example is shown in Figure 10.1. Such a tetrahedron is the element we use in the finite 
element method for 3D problems. 

To every tetrahedron we attribute a unique, global index with values e, /, ■ ■ • G {0,1, 2,..., K}. Like¬ 
wise, all nodes of a tetrahedra mesh have a unique global index with values i,j, ••• <E {0,1,2,..., AT}, 
where N < 4 K. For each tetrahedron e, the nodes also have a local index with values € 

{0,1,2,3}. The implication is that while the global index i is unique for a given node, then for each node 
indexed by i there will be a number of local indices m, n,..., corresponding to each tetrahedron e that 
share the Fth node To make matters worse, we will often need to switch between the global and local 
indices for a given node. Ironically, the finite element method appeal's to be complex, but the core is just 
work and force balance, however the indexing is often the greatest challenge. 

All nodes have a coordinate vector, and the coordinate vector of the n th node is x n = \x n , y n , z n ] T . 
These indices are illustrated in Figure 10.2. 

The finite element method considers not only the nodes of the tetrahedra, but the full continuum of 
positions inside each tetrahedron. A point inside a tetrahedron is parametrized by a set of local coordinates, 


(10.1a) 

(10.1b) 

(10.1c) 


W 0 ,Wl,W2,W S , 


(10.2) 




i 


Figure 10.2: Tetrahedra mesh notation convention. Three tetrahedral elements arc shown, with global 
indices e, /, and g. The nodes of element e have global indices i,j, k, l. 

where the value of w n is 1 at node n, zero at the remaining nodes, and a linear - function in between. 
Furthermore, 

Wq + W\ + W 2 + tt> 3 = 1, (10.3) 

and these coordinates are often called natural coordinates or Barycentric coordinates. An illustration of 
the tetrahedra natural coordinates is shown in Figure 10.3, where Figure 10.3(a) shows the point inside 
the tetrahedron, and Figure 10.3(b) shows planes of equal values of wq. Any function / linear in x, y, and 
z that takes the values f n at the nodes may be interpolated in terms of the natural coordinates using, 

3 

f(w 0 , wi,w 2 ,w 3 ) = f 0 w o + fiwi + f 2 w 2 + hw :i = ^2 f n w n . (10.4) 

n=0 

With this lineal - relationship on the four nodes of the tetrahedron, we can, for example, compute the 
coordinate of any point inside the tetrahedron in terms of the natural coordinates as, 

X(W 0 ,WI,W 2 ,W 3 ) = X 0 W 0 + XiWl + x 2 w 2 + x 3 w 3 . (10.5) 

Lineal - polynomial for interpolating a scalar field fix, y, z) at a given position p = [1, x, y, z] T inside 
the tetrahedron is given as, 

f(x, y, z) = a 0 + a\x + a 2 y + a 3 z = p T a , (10.6) 

where the a’s ai - e the polynomial coefficients (to be determined) and a = [oo, a±, a 2 , a 3 ] T . Given 
the values of /o = /(xq, yo, Z(f), .... f 3 = f(x 3 ,y 2 ,z 3 ), at the four tetrahedron corner points, xq = 
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Figure 10.3: Tetrahedra natural coordinates. In (a) is shown a general point inside a tetrahedron, and in 
(b) are shown planes of equal values of wq. 


[ xq , yo , zo ] T , • • •, *3 = [^3,2/3, z^] T we can set up the linear system 


/o 


1 X 0 Vo z 0 


ao 

fl 


1 xi yi zi 


at 

h 


1 X 2 V 2 Z 2 


<22 

h 


1 ®3 2/3 ^3 


<23 


= Pa. 


(10.7) 


The matrix P is invertible as long as the four points arc in general position , meaning that no point can be 
written as a linear combination of any of the three other points. This is the case whenever a tetrahedron is 
not collapsed into a triangle, line, or point. For four points in general position we can solve the polynomial 
coefficients as, 


a = P 


-l 


( 10 . 8 ) 


For any point p = [1, x , y , z] T we thus have, 


/( x , y , z ) = p T P~ 


(10.9) 
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Denoting the n’th column of p 1 P 1 by N n we have 


f(x,y,z) 


[JVo Ni N 2 N 3 ] 


fo 

fi 

/2 

fo 


3 

= ^ N nfn, 


n =0 


( 10 . 10 ) 


In case / is a vector field, then each coordinate can be interpolated independently. 


where 


f{x,y,z) 


\N 0 Mi N 2 



3 


= N n f n , 

n =0 


( 10 . 11 ) 


N 


n 


N n 0 0 

0 N n 0 
0 0 N n 


( 10 . 12 ) 


Let us now try to look at the barycentric coordinates, w o,..., w\ of the point [x, y , z\ T . These can 
also be used for interpolation 


f(x,y,z) 


Wofo + Wlfl + W’2/2 + W’3/3 = w T 


fo 

fl 

h 

fo 


(10.13) 


From the coordinates of the four points and the condition 1 = too + vj\ + w 2 + tv:> we can set up the linear 
system 


1' 


'1111' 


w 0 

X 


,T 0 Xl X 2 T 3 


W\ 

y 


yo yi 2/2 2/3 


w 2 

z 


_Z 0 Zl z 2 Z 3 _ 


w 3 _ 


$ 

p = Qw. 


(10.14a) 


(10.14b) 


Since the four points are in general position Q is invertible, and we can solve for w as, 


w = Q l p. 


(10.15) 
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Insertion into the barycentric interpolation formula yields 


i„\ T 


f(x,y,z ) = (Q 1 2 3 4 p ) 


p t Q t 


(10.16a) 


(10.16b) 


Comparison with the polynomial interpolation derivation we see that P = Q r , furthermore we notice 
that w n = N n in the case / is a scalar, and N n = diag (w n , w n , w n ), when f is a 3D vector. Hence, not 
surprisingly, it is seen that barycentric interpolation is just linear - polynomial interpolation. 


10.2 Elastic Solids with Finite Element Models 

The finite element method is a procedure for obtaining numerical approximations to the solution of bound¬ 
ary value problems. The general idea is to formulate a deformation of a solid as the matrix equation, 

Ku = f, (10.17) 

where K is a symmetric, positive definite, stiffness matrix , u is a vector of nodal displacements, and / 
is a vector of external node forces. The goal is to solve for the value of u. To simplify the problem, 
the entire computational domain is replaced by the union of disjoint subdomains called finite elements. 
These subdomains are tetrahedra in a volume mesh. Other volume types may be used, but the tetrahedral 
geometry is often the computationally most feasible for computer animation. 

In computer animations, the Three-Dimensional Linear Elastostatics Model is typically used and will 
be described in the rest of this chapter. It uses the following assumptions: 

1. The deformations involved are small, in which case the relationship between stress and strain are 
well approximated by a linear model. 

2. Only elastic materials are considered, thereby ignoring the viscous stress. 

3. Only isotropic materials are considered, which implies that elasticity matrix can be described by 
two parameters only. 

4. The object is in static equilibrium, which allows us to ignore inertia forces. 

In the following, we will derive the three-dimensional linear elastostatics model for computer animation. 
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Figure 10.4: Deformation of a tetrahedron is given as a set of displacement vectors. 


10.2.1 Displacement and Shape Functions 

The displacement function at a given point x = [ x , y, z] T in the object, is given as 


u = 


u 

u y 

u z 


= X — X. 


(10.18) 


where u x is the displacement along the .r-axis, u y is the displacement along the y-axis, and u z is the 
displacement along the z-axis. Thus, every point x u in the undeformed object corresponds to point x = 
x u + u in the deformed object. This is illustrated in Figure 10.4. Here the traditional finite element 
literature adds a further level of complexity by considering material properties, which both x and x „ 
depend on. However, we will not need these for the deformations we will discuss in this chapter. 

The deformable object is discretized into tetrahedral elements, where each node has a deformation 
u n = x n — x vn . A lineal - approximation to the displacement function inside an element e from its nodal 
values is given as, 


u(w 0 ,wi,w 2 ,w 3 ) 


[«0 I «1 


u 2 I U 3 ] 


w 0 

W\ 

VJ2 

U>3 


u 0 

< 

u b 


uf 

wi 


ut 


u y 2 

U Z 2 


U' 


x 

3 

V 

U 3 

Un 


1 

U) 0 


W\ 


W 2 

- 

_W 3 _ 


(10.19a) 


(10.19b) 


Thus, the natural coordinates w n form a finite dimensional basis for the space spanned by the tetrahedron. 
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It is often convenient in an implementation to have the nodal displacements arranged node-wise as a 
single vector. For the element e, the node displacement vector u e is given as: 


u 0 

Ui 

U 2 

U:i_ 

= [uq Uq Uq uf u\ u\ U3 ] T . 


Hence, using 


N r 


w n 0 0 

0 w n 0 
0 0 w n 


we may now rewrite the linear - interpolation in (10.19a) as, 


'LL = [No Ni N2 
3 

— ^ ' N n u n 

n= 0 

= Nu e , 


u 0 


N 3 


u 1 
U2 




( 10 . 20 a) 

(10.20b) 

( 10 . 21 ) 


( 10 . 22 a) 

(10.22b) 

(10.22c) 


where N = [ATq N 3 TV 2 TV 3 , and where the submatriccs TV,, arc called the shape fimctions. 


10.2.2 Strain 


To describe the deformation of an object, a measurement is needed. We use the linear Cauchy strain 
matrix , defined in Section 22.7, which relates the displacement u to the strain, e. In 3D the strain matrix 
is given as 


where 



£11 

£12 

£13 

£ = 

£21 

£22 

£23 


_£;ti 

£32 

£33_ 

£ ij = 

K 

dui 

dxj 

duj 

dxi 


(10.23) 


(10.24) 
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and where x\ = x, X 2 = y, and x '3 = z has been introduced for notational convenience. The matrix e is 
symmetric, and its six independent components may conveniently be rewritten on vector form as, 


£ = [ett £22 £33 712 713 

r dui 
dxi 
du,2 
8x2 

dus 

8x3 

du\ 1 du2 
8x2 ' dxi 
8u 1 _i_ &U3 
8x3 'dxi 
du2 1 du3 
-8x 3 ' 8x2- 


r 8 

dxi 

0 

0 

8 

8x2 

8 

8x3 

0 


0 

8 

8 x 2 

0 

a 

dxi 

0 

a 

8x 3 


0 ' 

0 

8 

8x 3 

0 

8 

dxi 

8 

8x2- 


U 1 
U2 
U 3 


= Su. 



(10.25a) 


(10.25b) 


(10.25c) 


(10.25d) 


where 7 ^ = 2e t] . The matrix S is an operator si mi lar to the gradient. The first three components of the 
strain measures stretching in the coordinate directions, while the last three components measure shearing. 

A lineal - interpolation of the relation between strains and nodal displacements inside a finite element 
may be obtained by combining (10.25) with the linear approximation from (10.22): 


e = SNu e = Bu e , 


(10.26) 


where the matrix B = SN. Naturally B has a block structure si mi lar to N, i.e., 


B = S [iVo JVi N 2 -ZV 3 = Bq B 1 B 2 B 3 


(10.27) 
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where B n = SN n for n = 0,1, 2,3. Straightforward evaluation gives, 


B 


n 


r 8 

dxi 

0 

0 

d 

8x2 

8 

8x3 

_ 0 

bn 

0 

0 

dn 

0 


0 

0 ' 

8 

0 

dx 2 

0 

8 

dx 3 

8 

0 

dxi 

0 

8 

dxi 

8 

8 

dx 3 

dx 2- 


0 0 

c n 0 
0 d n 
b n 0 
0 b n 
dn Cn 


W n 0 0 

0 W n 0 

0 0 w n 


(10.28a) 


(10.28b) 


where = 


dw n „ _ 8w n „ nf i j _ 8w n 
dxi ’ — 8x2 ’ dMU Un ~ 8x3 ‘ 

When computing the stiffness matrix we arc interested in derivatives of the N n ’ s with respect to the 
x-, y-, and 2 - coordinates. From (10.9) we see 


dNn 

dx 

dNn 

dy 

dNn 

dz 


= P 


-1 

1 ,n’ 


= p 


2,n’ 


— P 1 

~~ -^ 3 ,n» 


(10.29a) 

(10.29b) 

(10.29c) 


where we have used zero-indexing. Instead of actually computing the inverse of the 4x4 P matrix, a 
computationally more efficient solution exists, which only requires us to solve for a 3 x 3 system. This is 
achieved using the following theorem. 

Theorem 10.1 

Let x n = [x n , y n i z n ] T for n = 0,1, 2, 3, and define 


eio = *1 — *0; 

^20 = *2 - X 0 , 
e 30 = X 3 — Xq. 


Furthermore, define the matrix P as, 

[1 ..'1 

1 xf 
1 xl 
1 x\_ 


© 


(10.30a) 

(10.30b) 

(10.30c) 


(10.31) 
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and the matrix E as, 



eio 


I"(xi - x 0 ) (yi - 2 /q ) (zi - z 0 )l 

E = 

e20 

= 

1 

CN 

1 

CN 

3 

o 

« 

1 

CN 

A 


^30 


(x 3 - x 0 ) ( 2/3 - 2 / 0 ) (^3 - A))l 


Then the matrices P and E are related according to, 


det P 

pi 

1 n+l,m+l 


= det E, 
= E~ l 

n,m * 


(10.32) 


(10.33a) 

(10.33b) 


Proof of Theorem 10.1: 

It is simple to show that det P 
we use Cramer’s rule. 


det E, and this is left as an exercise for the reader. To prove (10.33b) 


P 


-l 

n,m 


(—l) n+m det P mn 
det P 


(10.34) 


where det P rnn is the determinant of P with the m’th row and n’th column removed. By straightforward 
computation we verify that 

_^(n+l)+(m+l) _ _(10.35) 


and what remains to be proven is that 


det P n+ i >m+ i — det E nm , 


(10.36) 


A total of 9 cases exist for this, but here we will only show a single case and leave the remaining cases for 
the reader. Setting n = 1 and m = 0 implies that 


det P 12 = det E qi. 


or equivalently that 


which is trivially true. 


det 


'1 

x 0 

Zo 


r 




1 

X2 

Z2 

= det 

X2 

- x 0 

Z2 

- Zo 

1 


x 3 

- X 0 

Z3 

- zo_ 

X3 

Z 3_ 







Thus we have 



As can be seen from (10.29), for n > 0 we have, 


dNn 

dx 

dNn 

dy 

dNn 

dz 


E 


-l 

0 ,n—l! 


E 


-l 

l,n—1 ’ 


E 


-i 

2,ra—I' 


(10.37) 

(10.38) 

□ 

(10.39) 

(10.40a) 

(10.40b) 

(10.40c) 
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Figure 10.5: The projection of surface element dS onto the x — z plane. 


For n = 0 we cannot use the above equations, since the first column of P 1 is missing. Instead we use 
the normalization of barycentric coordinates, 


Wp = 1 — W\ — W2 — Wo 

Taking the derivate of w n w.r.t. x, y, and z yields 

dN 0 

dx 
dN 0 
dy 
dN 0 
dz 


(10.41) 


The notation b n = c n 
four 5-vectors, where 


dN n 

dy 


— — FT 1 — wr 1 

-^00 -^01 -^02 » 

(10.42a) 

— Er 1 — — Er 1 

^10 -^n ^12 J 

(10.42b) 

— Er 1 — E — Er 1 
- C/ 20 ■ C/ 21 -^22 • 

(10.42c) 

djX. j s used. Furthermore all the derivatives arc returned as 


B r 


[b r 


(10.43) 


10.2.3 Stress 

Stress is defined as force per unit area, the standard unit for stress is the same as the unit for pressure, that 
is pascal (Pa). Start by considering a surface element dS, which may be represented in terms of the unit 
surface elements as [dS x , dS y , dS z ] T , where dS y is the area projection onto the x — z plane as illustrated 
in Figure 10.5. If the force df x is applied along the x-axis of the material surface dS y orthogonal to the x 
and 2 axes, then the shear stress, a xy , is 

_ df x 

aXy ~ dSy 


(10.44) 
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Similar the normal stress, a 


yy 


is 


a yy ~ 


dfy 

dS„ 


(10.45) 


Consider all forces along all coordinate axes df x , df y , and df z applied to all surface directions dS x , dS y 
and dS z . These forces yield what is known as Cauchy’s stress hypothesis [Lautrup, 2005]: 


dfx 


*3 xx dSx + (TxydSy + <3xzdS z 

dfy 

= 

OyxdSx + CyydSy (Jy Z dS Z 

_df z _ 


CTzxdSx + (3 Z ydSy + cr zz dS z 


(10.46) 


where each coefficient a tJ depends on position and time. We can collect the coefficients into a stress 
matrix a: 



&XX 

®xy 

&XZ 

a = 

(3yx 

a yy 

a yz 


O zx 

&zy 

G zz 


(10.47) 


A very general condition that is imposed on the stress matrix is symmetry, which means 


&ij — O’ji- 


(10.48) 


This reduces the stress matrix from 9 components to only 6. We thus write it in vector form as 




CTr 


a 


yy 


o' 2 


<7 


xy 


OV 


<7 


zy i 


(10.49) 


The proofs of Cauchy’s stress hypothesis and symmetry of the stress matrix can be found in [Lautrup, 2005, 
Chapter 9]. The six independent stress components may then be related to the six corresponding strains e 
by the linear - matrix equation 

a = De. (10.50) 


This equation is Hooke’s Spring Law generalized to elastic solids, which can help in the interpretation. 
Think of stress as a sort of spring force, and strain as a sort of spring elongation. 

The isotropic elasticity matrix D, described in Section 22.7, is defined as 



~l-v 

V 

V 

0 

0 

0 


V 

1 - V 

V 

0 

0 

0 

Y 

V 

V 

1 - V 

0 

0 

0 

(l + i/)(l-2z/) 

0 

0 

0 

l—2u 

2 

0 

0 


0 

0 

0 

0 

1 — 2 ^ 

2 

0 


0 

0 

0 

0 

0 

l—2u 
2 . 


(10.51) 


where Y is Young’s modulus. Young’s modulus is a measure of instretchability, such that a large Young’s 
modulus implies that the material is har'd to stretch. Some typical values arc 200GPa for steel, 0.05GPa for 
rubber, and 0.032GPa for cork. The coefficient — 1 < u < 1 /2 is the Poisson’s ratio, and it is only in rare 
instances negative. The Poisson’s ratio describes how much a material contracts in the direction transverse 
to stretching. Negative values imply that the material gets thicker when stretched. Typical values arc 1/2 
for incompressible materials such as rubber, 0.33 for steel, and almost 0 for cork. 
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10.2.4 Balancing Work 


In the following will we derive the stiffness matrix equation for a single tetrahedron. 

Stress, discussed in Section 10.2.3, is the governing force internally in a solid material. External forces 
arc applied either to the nodes or to the tetrahedral surfaces. The forces acting at the nodes of the e’th 
tetrahedral element is written as 


9o 



93 . 


(10.52) 


and nodal forces come from the neighboring tetrahedra inside a mesh. External distributed load on the 
boundary of the tetrahedra, such as an instrument pressing against an organ, is called traction, t, and has 
unit force per area. In short, the following forces arc present for the isolated system consisting of a single 
tetrahedron: 


• Stress forces arc internal forces 


• Nodal forces arc external forces 


• Load forces arc external forces 

Assume that we arc given a virtual displacement of the nodes of tetrahedron e termed 5u e , such that 
every node is moved to a new displacement, ( u e )' = u e + 5u e . During the virtual displacement work is 
performed in the system. The corresponding displacement for every point inside the tetrahedron may be 
obtained using our interpolation equation, and the result may be used to calculate the strain and stress for 
every point inside the tetrahedron, using (10.22) and (10.53b). Hence, we find, 

6u = N5u e , (10.53a) 

6e = BSu e , (10.53b) 

However, our goal is the opposite: given a set of forces, what is the resulting displacement. Our strategy 
will be to use the work balance equation, since the amount of work performed by the surface of a tetrahe¬ 
dron must equal the accumulation of work done by its interior. This is also known as static equilibrium , 
and for our system of nodal, surface, and volume forces it is given as, 


W a dV = W a + 






W t dA , 


(10.54) 


where (V and diY arc points inside and on the surface of the tetrahedron respectively, 
known as the Work Balance Equation. The internal work per unit volume performed by 
is given as, 


W a = ( 5e) T cr , 


This equation is 
the stress forces 

(10.55) 


the external work performed by the nodal forces, q e is given as, 


W q = (Su e ) T q e , 


(10.56) 
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and the external work per unit area performed by the external distributed load is given as, 

W t = ( 5uft. 

Substituting (10.55), (10.56), and (10.57) into the work balance equation (10.54), we find 


(. Se ) 7 <j dV = (5u e ) T q e + / (5u) T t dA. 




isae 


Substituting by (10.53a) and (10.53b) gives 


(10.57) 


(10.58) 


(. B5u e ) T a dV = ( Su e ) T q e + / ( N5u e ) T t dA. 


>sn e 


(10.59) 


Expanding the transpose operator, and moving the constant virtual displacement outside the integrals 
gives, 


(< Su e ) T j B t ct dV = ( du e ) T ^ q e 


+ 


N T t dA 


/<5Q e 


(10.60) 


Since this equation must hold for any virtual displacement 5u e , we conclude that the multipliers must be 
equal, i.e., 


B T a dV = q e + 


N 1 1 dA. 


(10.61) 


JVL e J 5fl e 

Although appealing complex, the terms under the integral are simple to evaluate. To achieve this, we first 
substitute (10.50) followed by (10.26) to get. 


B r DedV = q e + / N T tdA 




/<5Q e 


4 


B 1 DBu e dV = q e + / N T t dA. 




isae 


(10.62a) 

(10.62b) 


The matrices B and D arc constant w.r.t. the integration as arc the nodal displacements u e , hence we find 
that 


B T DBu e [ dV = q e + I N T t dA 

J r2 e J 5r2 e 

(10.63a) 



B T DBu e V e = q e + [ N T t dA, 

(10.63b) 


where V e is the volume of the tetrahedron respectively. Introducing the element stiffness matrix 


K e = B t DBV' 


(10.64) 
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and the element surface force, 

f e =l NH dA, 


(10.65) 


we finally have the matrix equation, 

K e u e = q e + f e , (10.66) 

where K e G M 12x12 is the element stiffness matrix, u e G M 12 is the nodal displacements, q e E M 12 is the 
nodal forces, and f e G M 12 is the element force. 


10.2.5 Assembling the Work Balance Equation for all Tetrahedra 


In the previous section we saw that the work balance for a single tetrahedron resulted in a simple matrix 
equation in terms of the nodal forces and displacements. We have such an equation for each tetrahedron 
in a mesh, and these equations will be tied together in this section. 

Newton’s third law dictates that in static equilibrium all forces at a point must sum to zero. Thus, for 
a given node i, all the nodal forces coming from neighboring tetrahedra must sum to zero. Assume that 
arc the local indices in elements e, /,... that correspond to the global index i, then the force 
balance equation may be written as, 

<Zn+ <£ + ••'= 0. (10.67) 

To obtain a computationally more efficient form, we first investigate the structure of K e . It is defined as 
K e = B T DBV e , and since 

B = [Bq B i Bo B 3 ,] (10.68) 

we can decompose K e into submatrices 


-BT 

Bj 

Bl 

[bI\ 

D \Bq 

B\ Bo B 

3] w 5 


Bq DB 0 V e 

BqDBiV £ 

B^DB 2 V £ 

BqDB 3 V £ 

B^DB 0 V e 

BjDBiV e 

BjDB 2 V e 

BlDB 3 V £ 

BlDB 0 V e 

B%DBiV e 

B 2 DB 2 V e 

B\DBfV £ 

BjDB 0 V e 

BlDBiV e 

BjDB 2 V e 

bIdb 3 v £ 


Using the notation K e nm = Bf L DB m V e we simplify the above to 


(10.69a) 


(10.69b) 


TS< 

00 

-"■01 

K e 0 2 

K e 03 

10 

IK 11 

K \2 

>5 

CO 

-^20 

zc< 

/v 21 

Kl 2 

23 

If 1 
. 30 

^31 

K% 2 

TS< 

33 . 


(10.70) 


The matrix K' has a special symmetry for isotropic materials: 

(■ BlDB m V e ) T = B T m DB n V e 


(10.71) 
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since D is symmetric for isotropic materials. Hence the submatrices arc related as, 

(K e \ r = K 

\ IJ) 

which is conveniently used in calculating K e . Furthermore, writing the elasticity matrix D as, 

'D 0 Dx Dx 0 0 O' 

Dx D 0 Dx 0 0 0 

Dx Dx Do 0 0 0 

0 0 0 £> 2 0 0 ’ 

0 0 0 0 D 2 0 

o o o o o d 2 


(10.72) 


(10.73) 


where Do = 


_ Y(l-u) 


(l+z/)(l-2z/) 


,D x = 


(l+z/)(l-2 v) 


. and D i = 2 (i+ ( J)(i^L) » and we evaluate K e nm 


K e = 

*■ nm 


Dq b n bm T ^2^0 nCm T £ 
D\c n b 

m + D2b n C rr 

djibfji + D2b n d ri 


D\bnCm + D2C n b 

m 

Dq C n Cm + D2(b n b rn T 

Did n Cm -\- D2C n d rn 


(10.74a) 


Dibndr, 

D i c n d-n 


D2d n b ri 
D 2 dn c n 


Dodndm -|- D2(b n b r} 


(10.74b) 


The nodal force coming from the e’th element onto the i’th node in global index and n’th node in local 
e index, we have that 


Q e n = K e n0 u e 0 + Klxu\ + K e n2 u e 2 + Kftu% - ft n , 


(10.75) 


where is the displacement of the n’th node in element e. The force balance on the i’th node in global 
indices corresponding to n, ... in local e, ... element indices. Thus, 


0 = X] Kloul + Klxu\ + K e n2 u e 2 + K^u% - ft, 


(10.76) 


where % is the set of tetrahedra including node with global index i and corresponding local index n. 

We will now show how to assemble all the force equations into a single and easily invertible system 
of equations. The assembly is best explained through examples. Consider the 2-tetrahedra mesh shown in 
Figure 10.6. The two tetrahedra e and / share a node, l, and therefore only have one coupling. The node 
l has local index 3 and 0 in tetrahedra e and / respectively. Hence we have one force balance equation, 
which sums over e and /, 

0 = K% 0 ul + K\ x u\ + + K\ 3 u\ - ft 

+ Kq 0 Uq + K^ m u{ + Kq 2 u 2 + Kq 3 u^ — /g. 


(10.77) 
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2 ,n J-,j 



Figure 10.6: A mesh consisting of two tetrahedra coupled at node l. 


To simplify the above equation we wish to write it on the form. 


Ku = f, 


(10.78) 


where K is the assembly of all the tetrahedral stiffness matrices, u is a vector of all the concatenated 
displacements, and f is likewise an assembly of external forces. The first step to achieve this is to identify 
the local nodal indices with their global counterparts as u t = u' (] , uj = u\, ..., and thus we write, 


0 = K^Uj + K\iUj + K^2 u k + -f^33 u l — f 3 
T -K- 00 u l -K -01 u m. "F -£^02tin T -^03^0 — 

Then we collect terms in front of the nodal displacements to get. 


Tf e 

30 


31 


TS P 

32 


(Kf i:i + K{ )0 ) K f 01 K{ 2 K 


This has the form of (10.78) with 


03 


Ui 

U j 

U k 

Ul 

Um. 

U n 

U n 


- (ft + fo) 


= 0 


K = 


K to K 


31 


U 




Ui Uj 


f = n+n- 


K 


32 


Uk 


(K\ 


33 + K 

Un U, 


f ) 

ool 


-^oi 


Kf 


Kf 

-^03 


(10.79) 


(10.80) 


(10.81a) 

(10.81b) 

(10.81c) 


© 


© 


Ul u m 


© 
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Figure 10.7: A mesh consisting of two tetrahedra coupled at nodes I. 


A more complicated configuration is shown in Figure 10.7, where a mesh is shown consisting of two 
tetrahedra sharing a face, which is therefore coupled at the nodes i, j, and l, which have local coordinates 
0,1, 3 and 0, 2, 3 in elements e and / respectively. Hence we have three force balance equations, which 
sums over e and /, 


0 = + K% x u\ + K% 2 u\ + K% 3 u% 

0 = K e 10 u e 0 + K\ x u\ + K\ 2 u\ + K\ 3 u\ 
0 = K e 30 u e 0 + K% lU \ + K\ 2 u\ + K\ 3 u\ 


fo + KlA + + Kq 2 u 2 + ^03 U 3 fl> 

(10.82a) 

fi + K f 20 u f 0 + K f 21 u{ + K f 22 u{ + K 23 u 3 - f f 2 , 

(10.82b) 

f 3 + K 30 Uq + K 31 u{ + K 32 u 2 + K 33 u 3 — f 3 , 

(10.82c) 


Again we identify the global displacement with the local and write, 


0 — KqqU{ + K e 01 Uj + KQ 2 Uk + Kq 3 ui — f q + Kq 0 Ui + K 31 u m + K^ 2 Uj + Kq 3 ui — /q, 

(10.83a) 

0 = K\ 3 Ui + K\iUj + Kl 2 iik + K\ 3 ui — + K 20 U{ + K 21 u m + K 22 uj + K 23 ui — 

(10.83b) 

0 — K 3 Qiii -F K 31 Uj + K 32 Uk 4 “ K 33 U[ f 3 + K 3 qU{ + K 33 u m -F -^32"F ^33 f 31 

(10.83c) 
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I< = 0 
/ = 0 

for all tetrahedra e do 
for each node m of e do 
find global index i of m 
for each node n of e do 
find global index j of n 
Kj,i+ = K'-. rn 
next n 

fi+ = fa 

next m 
next e 


Figure 10.8: Pseudocode for the assembly of the global stiffness matrix K and the global force vector f. 


Finally we gather the equations on the form of (10.78) and find. 


0 = 


K e 
■*'■ 00 

TS< 

10 


K 


30 


+ k’„, 
+ k’ w 
+ K‘ m 


Thus we identify. 


^01 

Kh 

K%i 


+ Kq2 
+ K f 22 
+ K{ 2 


K e 

A 02 


■12 


K e 03 

K h 


Kh K 


33 




Kf 

- rv oi 

21 

Kf 

31 


Ui 

U j 

Uk 

Ul 

ZLm. 


n + rt 
n + H 
n+ft 


(10.84) 


K = 


K e 00 

Kh 


K 


30 


+ K S z o 


u = 


Ui 

Uj 

Uk 

Ul 

Urn. 


f = 


fo + f 0 

fi+rt 

f 3 + fl 


K e 01 


■ li 


K 


31 


+ K f 0 2 

+ k{ 2 

+ K {2 


TC e 

02 


■12 


Kh 

Kh 


Kh K 


33 


+ k’ 23 
+ k{ 3 


Kf 

K? 

■“■ 21 

Kf 

-"■31 


(10.85a) 


(10.85b) 


(10.85c) 


The general assembly is shown as pseudocode in Figure 10.8. The assembly will assemble all the 
local stiffness matrices into one global stiffness matrix, such that 


Ku = f 


( 10 . 86 ) 


where 


u = [u 0 ,ui,u 2 ,.. ■] 


(10.87) 


© 


© 


© 
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and 

f = (10-88) 

In computer animation forces arc often set directly on nodes through the global /-vector instead of indi¬ 
rectly through local surface traction, t, in which case the computation of f f m is omitted in the pseudocode. 

Deformation of tetrahedral meshes can now be calculated by applying the external forces, calculating 
the global stiffness matrix, and solving the linear - system, 

u = K~ 1 f. (10.89) 

This system may be solved efficiently, for example, using the conjugate gradient method, but the result is 
only static deformations. In the following we will extend the method to dynamic animation. 

Using a global stiffness matrix is not the most efficient approach, and a computationally more efficient 
approach will be shown later. 

10.2.6 Inertia and Damping Forces 

In this section we will show how to extend the equations of motion to include inertia and damping forces. 
This is done straightforwardly by adding the corresponding work-energy terms to the derivation in Sec¬ 
tion 10.2.4. 

Inertia forces originate from the movement of inertia, i.e., inertia forces are mass times acceleration. 
For a virtual displacement, 5u, the work performed by the inertia force per unit volume is, 

Wmertia = Su T pit, (10.90) 

where p is the mass density and it is the double time derivative of u. Likewise, for a linear - damping force 
per unit volume, the work performed is 

(Fdamping — du cil (10.91) 

where c is the damping coefficient and it is the time derivative of u. To include inertia and damping forces 
we extend the work balance equation (10.54) as, 

[ Wi nertia dV + [ fU damping dV+ f W a dV = W q + f W t dA, (10.92) 

Jfi e Jci e Jsfi e 

The values of Su may be interpolated from the nodal values, 6u = N5u e , and assuming that we also 
have access to time derivatives at the nodal values, we may equally have it = Nu e and it = Nu e . Thus, 
as we expand the two new terms in (10.92) we find that 

[ 5u T pu dV + [ 5u t cu dV + [ W a dV = W q + f W t dA, (10.93a) 

Jfl e J Q e JCl e J5fl e 

f (N5u e ) T pNu e dV + [ (N5u e ) T cNii e dV + [ W a dV = W q + [ W t dA. 

Jsci e 


(10.93b) 
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The displacements 5u e and u e arc constant w.r.t. the integration, and they may be moved outside the 
integration. Further introducing the element mass matrix. 


and the element damping matrix. 


M e = / N T Np dV, 


C e = N 1 Nc dV, 

Jci e 


and expanding the remaining terms as described in Section 10.2.4, we find 

(du e ) T M e ii e + (< 5u e ) T C e ii e + (du e ) T K e u e = ( du e ) T q e + ( 5u e f f e , 


4 


(< 5u e ) T ( M e u e + C e u e + K e u e ) = ( 5u e Y ( q e + f e ). 


e\T t 


(10.94) 


(10.95) 


(10.96a) 

(10.96b) 


Again, since this is valid for any virtual displacement 5u e , we conclude that 

M e ii e + C e u e + K e u e = q e + f e , (10.97) 

Since u e = x e — x^, then u e = x e and u e = x e . Thus, 

M e x e + C e x e + K e (x e - x e u ) = f; n (10.98) 

where we have used the shorthand, f' CM = q e + f e . Assembling the total system into a single matrix 
equation we have, 

Mx + Cx + K(x — x u ) = / ext (10.99) 


10.2.7 Computing the Mass Matrix 


A mass matrix is a discrete representation of a continuous mass distribution. The element mass matrix is 
defined as 

M e = [ N T NpdV .; (10.100) 

and is called consistent. Considering the scalar case where N n G M, and assuming that p is constant over 
each tetrahedral element and using the linear shape functions, the above definition (10.100) results in 


M e = p 



(N 0 N 0 ) (iVoiVi) (N 0 N 2 ) (N 0 N 3 ) 
(NiNq) (N! Ni) (MriV 2 ) (JVi N 3 ) 
(N 2 N 0 ) (N 2 Ni) (n 2 n 2 ) (N 2 N 3 ) 
(N 3 Nq) (N 3 Ni) (N 3 N 2 ) (N 3 N 3 ) 


dV. 


( 10 . 101 ) 


Using [Cook et al., 2002, pp. 266], 



NqN\N^N 3 dV 


6 V e a\b\c\d\ 

(3 + a + b + c + d)V 


( 10 . 102 ) 
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we find that 


or equivalently 


M e = p 


y e 

20 


1 1 1 
2 1 1 


1 1 
1 1 


M ij ~ P^(l + <%)> 


(10.103) 


(10.104) 


where d tJ is the Kroncckcr delta. In the vector case where N n G M 3x3 , this means that for the tetrahedral 
element 


' 2 2 2 1 
2 2 2 1 
2 2 2 1 


1111 

1111 

1111 


1111 " 

1111 

1111 


M e = p 


20 


1 1 
1 1 
1 1 

1 1 
1 1 
1 1 


1 2 
1 2 
1 2 

1 1 
1 1 
1 1 


2 2 
2 2 
2 2 

1 1 
1 1 
1 1 


1 1 
1 1 
1 1 

2 2 
2 2 
2 2 


1 1 
1 1 
1 1 

2 1 
2 1 
2 1 


1 1 
1 1 
1 1 

1 1 
1 1 
1 1 


1 1 
1 1 
1 1 


1 1 
1 1 
1 1 


1 1 
1 1 
1 1 


1 1 
1 1 
1 1 


1 2 
1 2 
1 2 


2 2 
2 2 
2 2 


(10.105) 


Notice that in order to obtain the global/system mass matrix an assembly similar to the stiffness matrix 
assembly must be carried out. Further, the global M matrix will have the same subblock pattern as the 
global K matrix. 

A consistent mass matrix is often not used in computer graphics. Instead an ad hoc matrix called the 
lumped mass matrix is used, which is obtained by placing particle masses at the nodes. This corresponds 
to shifting all the masses in the rows of (10.103) onto the diagonal. In 3D this yields the element mass 
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for each node n 
mass(n) =0 
next n 

for each tetrahedron e 
for each node n of e 

ye 

mass (n) += p— 
next n 
next e 


Figure 10.9: Pseudocode for calculating the lumped mass matrix. 


matrix 


' 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 



0 

0 

0 
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0 

0 

0 

0 

0 

0 

0 

0 



0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 


Ct> 

II 

rfU 'n 

1 Ct> 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 



0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 



0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 



0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 



0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 



. 0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 


Thus, a lumped mass matrix is diagonal whereas a consistent mass matrix is not. Observe that the con¬ 
sequence is that the global mass matrix also becomes diagonal, and thus the assembly simply becomes 
an iteration over all tetrahedra, while incrementing the nodal mass by one fourth of the tetrahedral mass. 
This is demonstrated in Figure 10.9. Since V and p arc positive for all elements, both the element mass 
matrices and the global mass matrices arc symmetric positive definite matrices. 

The advantage of lumping is less storage and higher performance. The disadvantage is that discon¬ 
tinuity is introduced in the displacement field. [O'Brien et al., 2002b] state that the errors in lumping is 
negligible for small-size, course meshes such as those used in computer graphics. There does exist alter¬ 
native approaches for computing mass matrices; we refer the interested reader to [Cook et al., 2002] for 
more details. 


10.3 Stiffness Warping 

The lineal - elastic method introduced in the previous sections is only valid for small deformations. This is 
due to the fact that the linear Cauchy strain tensor e used in the calculation of the stiffness matrix K is 


© 


© 
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Figure 10.10: Illustration of the stiffness waiping concept 


a first-order approximation to the nonlinear Green-Lagrange strain tensor, as explained in Section 22.7. 
When the deformable object is subject to large rotational deformations, then the result is an unrealistic 
growth in volume. However, it is not a viable solution to use a higher-order approximation to the strain 
tensor for animation purposes, since this leads to systems of nonlinear equations, which are hard to solve. 

In [Muller et al., 2004a], a method was shown that removes the artifacts that linear elastic forces in¬ 
troduces, while keeping the governing equations linear. Consider the elastic force acting on a single 
tetrahedral element e, as described in (10.89) and repeated here for convenience 

f e = K e {x e - x e u ) = K e x e + fl, (10.107) 

where u e = x e — x^, and where x e and x f u are the positions of the four vertices of a tetrahedron at its 
deformed and undeformed state, K' is the element stiffness matrix, and f e u = K f xi a is the force offsets. 

Assume that the rotational part of the deformation R e for the tetrahedron e is known. The stiffness 
warping concept consists of rotating the deformed positions back into the undeformed frame, calculating 
the elastic force in this frame, and rotating the result back to the deformed frame. This is illustrated in 
Figure 10.10, and can be written as 


f e = R e K e (R e ~ l x e - x e u ) (10.108a) 

= R e K e R el x e - R e K e x e u (10.108b) 

= K e, x e + f e J, (10.108c) 

where R e is a 12 x 12 matrix that contains the 3 x 3 rotation matrix along its diagonal. By doing so, the 
elastic forces are the same as if the regular linear forces were calculated in a rotated coordinate frame. 

Stiffness waiping is not without peril, since it does suffer from element inversion, when a model is 
compressed violently. Recently a diagonalization method has been introduced to deal with the inversion 
problem [Irving et al., 2004], but this method is outside the scope of this book. 
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10.3.1 Stiffness Warping Assembly 

For an assembled system, the linear elastostatic model is, 

Ku = K(x - x u ) = f, 


(10.109) 


where u = x — x u is the nodal displacement, x is the current position and x u is the original position, f 
is the nodal forces, and K is the stiffness matrix. Using f u = —Kx u as the force offset vectors, we have 


f = Kx + f u . (10.110) 

Let R denote the current orientation, then R 1 rotates back to the original frame, which means we have, 


/ = RK(R 1 x - x u ), 
f = RKR x x — RKx u . 


So with stiffness warping we have to compute 


K' = RKR _1 , 
fu = -RKx u . 


(10.111a) 

( 10 . 111 b) 


(10.112a) 

( 10 . 112 b) 


For N nodes the system stiffness matrix K' is a 3 N x 3 N symmetric and sparse matrix. It would be 
insane to actually allocate such a matrix; instead the matrix is stored inside the nodes of the volume mesh. 

Each node stores a row of K' and the corresponding entries of the f' u vector. That is, the i’th node 
stores all nonzero 3x3 subblocks of the i’th row of the stiffness matrix, and it also stores the i’th 3D 
subvector of f u . Let % denote the set of all tetrahedral elements sharing global node i, then the assembly 
of the warped stiffness matrix can be written as, 


K 'ij = Y, R e K e nm R e 

i~n and j~m 


Furthermore, the assembly of the force offset vector 


f : i = E - KK - 


(10.113) 


(10.114) 


Notice that this can be optimized: letting the indices i, j, k, and rn denote global indices of e such that the 
global index i corresponds to local index n. Then the contribution from element e can be written as, 

— R e K e x — R e K e x — R e K e x — R e K e x 


— —R e (K e x A- K e x A-K e x A- K e x ) 

±c nn^wn ' -* v nm^Rm ' - £V no ^ UO ' np^RpJ > 

which saves us a few matrix multiplications, and we then finally have 


f ui ^ 'j R (R nn x un T R nm x um Rno x uo + R 


np X up ) 


(10.115) 


(10.116) 
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for each element e do 
for each node i of e do 
tmp = 0 

for each node j of e do 
tmp += Ke_ij * xo_j 
next j 

f0'_i -= Re*tmp 
next i 
next e 


Figure 10.11: Pseudocode for calculating f u . 


for each element e do 
for each node i of e do 
for each node j of e do 
if i >= j then 

tmp = Re Ke_ij Re^{T} 
K'_ij += tmp 
if j > i then 

K'_ji += trans(tmp) 
end if 
end if 
next j 
next i 
next e 


Figure 10.12: Pseudocode for assembly process for the warped stiffness matrix. 


Assuming that f un is initially cleared to zero for all n. This results in the implementation strategy shown 
in Figure 10.11. Also the stiffness matrix can be optimized slightly by exploiting the symmetry property. 
The symmetry indicates that it is sufficient to only compute the upper triangular and diagonal parts. The 
lower triangular parts can be found simply be taking the transpose of the upper triangular parts. Thus, the 
e’th tetrahedral element contributes with 

K^+= R e K e nm R eT , (10.117a) 

K' p += R e K e mn R eT = (. R e K e mn R eT ) T , (10.117b) 

where i ~ n and j ~ m, and since K e mn = (K ( nm ) r . Assuming that all K\ rj is initially cleared to zero 
this results in the implementation strategy shown in Figure 10.12. 

Note that if R e is initially set to the identity matrix, then the stiffness warping reduces to the tradi¬ 
tional assembly of stiffness matrix (as it is used in linear elastostatics). Also notice that whenever R' 
changes, then the stiffness matrix assembly must be recalculated. This is done after each position update. 
Furthermore, if the i’th node is set to be fixed, this corresponds to letting K' lt = 1, and K'^ and K' p 
be zero for all j not equal to i. This is known as a Dirichlet boundary condition. However, we do not 


■e 
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use this during the assembly. Instead we assemble the K' matrix as though there were no fixed nodes. 
Later, when we use the K' matrix in computations such as K'x, we simply test whether Xj is fixed. If 
this is the case, then it is multiplied by the j’th column of K', i.e., K[ i:j . If so, we simply do nothing! 
This is computationally more tractable, and it also allow us to more easily turn nodes fixed and unfixed 
dynamically during animation. 

10.3.2 Orientation Computation 

In order to perform the stiffness warping assembly described in the previous section, we need to be able 
to estimate the rotational warp of each individual tetrahedron. The most elegant method is to find the 
transform using barycentric coordinates [Muller et al., 2004a], and this method will be described below. 

Let the deformed corners be xq, X\, a; 2 , and *3 and the undeformed corners x u0 , x u |, x u2 , and x u3 . 
Examining point x u = [x, y, z, 1] T inside the undeformed tetrahedron is written as, 


x 


U 


®«0 

1111 




w 1 
VJ2 


= Pw. 


w 3 


(10.118) 


The corresponding point in the deformed tetrahedron has the same barycentric coordinates, which mean. 


Xq X! X 2 X 3 
1111 


Wq 


W 1 
VJ2 


= Xw. 


W 3 


(10.119) 


We can now use (10.118) to solve for w and insert the solution into (10.119) to get, 

x = XP~ 1 x u . (10.120) 

The matrix XP 1 transforms x u into x. Since both P and X have their fourth rows equal to 1, it can be 
shown that their product has the block structure 


XP 1 


G a 
0 T 1 ’ 


( 10 . 121 ) 


and this is recognized as a transformation matrix of homogeneous coordinates. The a vector gives the 
translation, the G matrix includes, scaling, shearing, and rotation. 

We thus need to extract the rotational matrix R from G. In [Muller et al., 2004a] the authors suggest 
using Polar Decompositions [Shoemake et al., 1992, Etzmuss et al., 2003]. However, a simpler although 
more imprecise approach would simply be to apply a Gram-Schmidt orthonormalization to transform 
G into an orthonormal matrix. This seems to work quite well in practice. We have not observed any 
noteworthy visual difference in using polar decomposition or orthonormalization. 
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Since we arc only interested in the G part of (10.121), we can compute this more efficiently exploiting 
the fact that barycentric coordinates sum up to one. If we substitute wo = 1 — w\ — w 2 — w 3 in (10.118) 
and (10.119), then we can ignore the fourth rows, since they simply state 1 = 1, i.e., 


1 — W\ — W 2 — U> 3 


— [Xuo 


•Bul 


( 10 . 122 ) 


which is 


— Xu 0 (®ul X u q) (®u 2 ®uo) (®u 3 ®uo)] 


(10.123) 


Moving the first column to the left-hand side we get, 


X U Q — (®ul Xuo) (Xu 2 X u q) {Xu 3 X u q) ^2 


(10.124) 


Introducing e w = x ul - x u0 , e 20 = x u2 - x u0 , e 30 = x u3 - x u0 , and E = [e w , e 20 , e 30 ], we have, 


(10.125) 


and similar for (10.119) we have, 


x u - x u0 = E \w 2 , 


W\ 

X — Xq = E' w 2 


(10.126) 


where E' = [e^e^e^o] and e' w = x\ — xq, e' 20 = x 2 — xq, e 30 = x 3 — x$. Now inverting (10.125) 
and inserting the result into (10.126) yields, 


x x 3 — EE (x u x u q)- 


(10.127) 


By comparison with (10.121) we see that 


G = E'E- 


(10.128) 


Using Cramer’s rule the inverse of E can be written as 


l (e 20 x e 30 ) 
E 1 = tttt (e 3 o x ei 0 ) T 
6V L(eio x e 20 ) T 


(10.129) 
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This can easily be confirmed by straightforward computation 

E~ 1 E = 1. 


(10.130a) 


Using the notation 


we write. 


and we finally have. 


1 


n i = 

= 6U 

e20 

X 

£30 


1 




n 2 = 

“ 6U 

^30 

X 

eto 


1 




«-3 = 

= W 

eto 

X 

e20 


E - 1 = 


Tin 


T 


G = [e\ 


to 


-20 


-30 J 


T 


n 

7 

ns 


n 


3 J 


(10.131a) 

(10.131b) 

(10.131c) 

(10.132) 

(10.133) 


Observe that E' is very inexpensive to compute and all nonprimed quantities {ni,ri 2 ,n^} can be pre¬ 
computed and stored on a per tetrahedron basis given sufficiently available memory. Even in case where 
memory is not available ni, n 2 , and n : > are quite cheap to compute. 


10.3.3 Plasticity 

The total strain of the e’th element is given by 


etotal = Bu e (10.134) 

Where u e is the nodal displacement, i.e., u e = x — x„. Applying the idea of stiffness warping we have 

e to tai = B{(R e y 1 x - x u ) (10.135) 

where R e is the rotational deformation of the e'th tetrahedral element. 

The elastic strain is given as the total strain minus the plastic strain 

£ — r: t, 11 a 1 £piastic (10.136) 

Aplastic is initial initialized to zero. During simulation the plastic strain is constantly updates as follows. If 
elastic strain exceeds c y i e id, then it is added to plastic strain by c cree p 


if I|\vec e||_2 > c_yield then 

e_plastic += dt*std::min(T->m_creep,(1.0/dt)) e 


£■ 
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If plastic strain exceeds c max , then it is clamped to maximum magnitude 

if I|e_plastic|I > c_max then 

e_plastic = (e_plastic/|e_plastic|) c_max 

The plastic strain causes plastic forces in the material 


/plastic = R e K e Up h 


Using e p i astic = Bu ( , []c we find. 


/plastic — R K B epi as ti c 

/plastic = R e {V e B T D B)B~ l e plast i c 
/plastic = R e iV e B T D) e plastic 

Introducing the plasticity matrix P e = ( V f B l E) we have, 

/plastic = R P e p iastic- 


(10.137) 

(10.138a) 

(10.138b) 

(10.138c) 

(10.139) 


The plastic forces can be computed efficiently by exploiting the structure of the B n and D matrices as, 


Bn = 


bn 0 0 

0 C n 0 

0 0 d n 

Cn b n 0 

d n 0 b n 
_ 0 d n c n , 

' D 0 Di Dx 0 0 0 

TO Dq Dq 0 0 0 

D\ D\ Dq 0 0 0 

0 0 0 D 2 0 0 0 

0 0 0 0 D 2 0 

0 0 0 0 0 d 2 


(10.140a) 


(10.140b) 


This implies that the product B D is 

bn //(i b n I) i b r! L) | c n D 2 d n D 2 0 

B n D = c n D\ c n Do c n D\ b n D 2 0 d n D 2 

d n D 1 dn I)\ (],, I){) 0 b n D 2 c n D 2 

Notice that even though this is a 3 x 6 matrix with 18 — 3 = 15 nonzero elements it actually 
9 different values. In fact, these values could be precomputed and stored in each tetrahedron, 
they could be premultiplied by the volume of the tetrahedron to yield the matrix, 

P e n = V e B T n D. 


(10.141) 


only contains 
Furthermore 


(10.142) 
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The plastic force that should be subtracted from node n is then computed in each iteration as 

f n = R e P e n e plastic . (10.143) 

Instead of allocating another set of temporary variables to hold the plastic forces, it is much easier to 
simply add the plastic forces to the external force vector. Thus saving memory. 

10.4 Time Integration 

The equation of motion has the following form (where u = x — x u , and x is derivative w.r.t. time) 

Mx + Cx + K(x — x u ) = / ext . (10.144) 

|_ i 

Implicit discretization, means we evaluated x and v = x at time i + 1. Furthermore, using x = - A ~ v 
and x = v l+1 , we have 


M -—-+ CV +1 + K(x i+1 - x u ) = / ext , (10.145) 

and since x l+1 = x l + v 1+1 At, 

M -—-h Cv l+l + K{{x l + v l+l At) - x u ) = / ext , (10.146a) 

i~\~ 1 i 

M V + Cv i+1 + Kx l + A tKv i+1 - Kx u = / ext , (10.146b) 

Mv i+1 - Mv l + A tCv i+1 + A t 2 Kv i+1 = A/(/ ext - Kx l + Kx u ), (10.146c) 

(M + AtC + At 2 K)v i+l = Mv i + A/(/ ext - Kx 1 + Kx u ). (10.146d) 

Let f u = —Kx u , then 

(M + A/C + A t 2 K)v i+l = Mv i - At(Kx l + f u - / ext ), (10.147) 

so we need to solve Av l+l = b for -i/ +1 , where 

A = (M + AtC + At 2 K), (10.148a) 

b = Mv i - At(Kx i + f u - / ext ). (10.148b) 

This is termed a velocity update. The matrix A and the vector b are most efficiently assembled in a manner 
similar to the stiffness matrix assembly discussed in Section 10.2.5. That means that each node stores the 
corresponding row and right-hand side of Av = b. 
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Algorithm time-step 
if(stiffness_warp) 
update-orientation 
else 

reset-orientation 
stiffness-assembly 
add-plasticity-force 
dynamics assembly (A,b) 
conjugate_gradient(A,v,b); 
position_update 
End algorithm 


Figure 10.13: A time-stepping algorithm for the finite element method. 

The matrix A is a symmetric matrix, which implies that we can solve Av l+l = b efficiently by using 
a conjugate gradient solver. In [Muller et al., 2004a] 20 iterations were used. 

After the velocity update we perform a position update 

x i+1 = x i + v i+1 A t. (10.149) 

Notice that a fully implicit scheme requires K 1 1 . however for linear elastic materials K is constant. The 
time-stepping algorithm is summarized in Figure 10.13. In practice a time-step the order of At = 0.01 
seconds can be used without any problems. 

The damping matrix C can be computed in various ways. Often a linear combination of the mass and 
stiffness matrix, known as Rayleigh damping, is used 

C = aM + ftK (10.150) 

where a and ft is known as the mass damping and stiffness damping coefficients respectively. In practice 
we often only apply mass damping with a = 0.2 and no stiffness damping ft = 0. This has the compu¬ 
tational benefit that if M is a lumped mass matrix, then C becomes a diagonal matrix. Furthermore one 
does not even need storage for C, since it is given implicitly by M. 

10.5 Mesh Coupling 

Due to the complexity of the dynamics of finite element methods, it is often convenient to separate the 
visual geometry from the geometry used to compute the dynamics. This makes it possible to use highly 
detailed visual representation of objects, while using a computationally low-cost coarse volume mesh for 
computing the dynamics. 

A technique for doing this is called mesh coupling or cartoon meshing. Below we describe how it is 
used together with tetrahedral meshes. It is, however, a general approach and can be used with other types 
of geometries. Free-form deformation lattices as discussed in Section 8.4.7 arc another very common 
example of mesh coupling. 
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The first step in mesh coupling is to bind the vertices of the surface mesh to the tetrahedral elements 
of the volume mesh. A spatial hashing algorithm is easily used to find vertex-tetrahedron pairs, where 
the vertex is embedded inside the tetrahedron. The actual test is done by first computing the barycentric 
coordinates of the vertex with respect to a tetrahedron in question. The vertex is embedded in the tetra¬ 
hedron, when 0 < w t < 1. In practice, we need to apply threshold testing to counter numerical precision 
problems, and as a consequence it may happen that vertices lying close to a face of a tetrahedron gets 
reported twice: once for the tetrahedron embedding it and once for the neighboring tetrahedron. There¬ 
fore, a quick rejection test can be performed: if the vertex is already embedded in a tetrahedron, then it is 
simply ignored. 

Before rendering each frame, we must update the vertex positions to reflect the underlying deformation 
of the tetrahedral mesh. This is done using the barycentric coordinates, such that the new vertex position 
is given by, 

c = WQX o + w\Xi + w 2 x 2 + W 3 X 3 , (10.151) 

where xq, x\, x 2 , and X 3 arc the nodal coordinates of the tetrahedron, which the vertex was bounded to. 

If stiffness warping is used, the element rotation, R e , can be used to update the undeformed vertex 
normal, no, into the deformed vertex normal, n, by, 


n = R e riQ (10.152) 

Often a tetrahedra mesh is used with a conservative coverage of the surface mesh. That means that one 
is guaranteed that all vertices of the surface mesh arc embedded inside one unique tetrahedron. However, 
mesh coupling can be used in cases where only one have partial coverage. The solution is to bind a 
vertex to the closest tetrahedron. Even though the vertex lies outside the tetrahedral mesh, the barycentric 
coordinates extend the deformation of the tetrahedra mesh beyond its surface. 


10.6 Finite Element Method in the Literature 

Finite element method has been used to model hand grasping a ball [Gourret et al., 1989] and muscle 
simulation [Chen et al., 1992], An implicit finite element method was used in [Hirota et al., 2001] to 
simulate elastic solids in contact. They used their method for simulating the muscles in a bending knee. In 
[O’Brien et al., 1999] the authors used static isotropic elastic finite element method with additive plasticity 
to simulate crack initiation and propagation in three dimensional volumes, i.e., brittle fracture. Later 
in [O'Brien et al., 2002a] both brittle and ductile fracture were simulated. [O'Brien et al., 2002b] used 
dynamic isotropic elastic finite element method to extract modal vibration modes of rigid bodies. The 
vibration modes where then used to synthesizing sound in rigid body simulations. 

In [Teran et al., 2003] the finite volume method was introduced to the graphics community, and it 
was shown to be identical to the classical FEM when using linear basis functions and constant strain 
tetrahedra. The finite volume method has the great advantage that it easily supports different constitutive 
models. Furthermore, it does not require the assembly process to compute elastic and damping forces. 

In [Irving et al., 2004] they extended the finite volume method with a diagonalization method capable 
of handling inverted tetrahedral elements. They also introduced multiplicative plastic flow and a method 
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for plastic control to achieve prescribed deformations to aid animation control by animators. A rigid-body 
collision law and improved Newmark time-stepping method were also described. 

In [Irving et al., 2005] the finite volume method was generalized for other types of elements than 
tetrahedra, but it was indicated that tetrahedra were the most cost-efficient element for computer animation 
purposes. In [Teran et al., 2005] a framework is presented for simulating skeletal muscles with the finite 
volume method using a transversely isotropic, quasi-incompressible constitutive model that incorporated 
muscle fiber fields. Inverted elements were handled using diagonalization. 

To overcome the computational disadvantage of linear, isotropic, elastic finite element method, many 
have applied modal analysis, see [Pentland et al., 1989]. Later [James et al., 2002] moved the ideas onto 
GPUs by a method called Dynamic Response Textures. [Hauser et al., 2003] introduced the idea of using 
model modes from dynamic linear elastic finite element model to achieve interactive deformations. Later 
[Choi et al., 2005] extended the ideas to include rotational warps in order to overcome the volume growth 
in large deformations. 

[Muller et al., 2004b] used the finite element method based on cubical elements of uniform size to 
simulate objects represented by surface meshes. They used a mesh coupling idea and added a fracture 
technique. [Muller et al., 2004a] improved on the original ideas in [Muller et al., 2002], where a rotational 
warp was applied to the elastic forces in the stiffness matrix assembly. In the later work they included 
mesh-coupling, plasticity, and fracturing. Furthermore, they rectified the problem of ghost-forces by 
computing the rotational warp on elements instead of on a per node basis. 
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The animation of fluid phenomena is widely used in the film industry and of growing importance in the 
gaming industry. The physics of fluid dynamics has been successfully described by the Navier-Stokes 
equation since about 1821, although later investigation into turbulent phenomena has added considerable 
knowledge about the behavior of fluids at high velocities, etc. The Navier-Stokes model is usually used to 
predict the motion of fluid particles or volumes inside a fluid, but the model is also a good description of the 
behavior of gaseous phenomena such as smoke at speeds less than the speed of sound. For animation, the 
interface between water and air is equally important, since water waves visually dominate water scenes. 
For animating water waves, there arc two competing models, one developed by Gerstner in 1809 and an 
adaptation of Navier-Stokes to shallow water situations. A detailed review of the origins of water wave 
theory is given in [Craik, 2004], This chapter starts by describing the model of water waves leading to 
Gerstner’s model, followed by a description of the Navier-Stokes equations and their applications and 
implementations. 

11.1 Waves 

The most prominent visual feature of water is its surface. In the following, we will discuss equations for 
waves with emphasis on Gerstner’s theory. 

11.1.1 Cosine Waves 

The simplest model of water waves is sinusoidal, meaning that the profile of a wave at a fixed time is 
well described by a sum of sine and cosine functions. Therefore, waves are typically described by the 
parameters amplitude and frequency using a sum of cosine basic functions, 

= o 5 (A(^L±M1), (11.1) 

where x is the spatial coordinate, A is the amplitude of the wave, 0 is its phase, and L is its wavelength. 
An example of a circular wave is shown in Figure 11.1. Furthermore, simple waves interact by addition. 
Waves propagate through space 

, . . ,. , f 2ir(x + 6{x) — Ct) \ 

f a,l,c( x i t) = A(x,t) cos i -—-J , (11.2) 

where t is the time parameter and C is the wave propagation speed. 

From an artistic point of view, the pure sinusoidal wave functions in (11.2) tend to be too smooth at the 
top; it has been suggested by [Finch et ah, 2004] simply to raise the cosine to some sufficiently pleasing 
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Figure 11.1: The snapshot of a circular water wave modeled as a radially symmetric cosine function. 


power as follows: 


f(x,t) = A(x,t ) 




(11.3) 


and normalize the cosine function to span the range from -1 to 1. An example of the result for k = 2.5 is 
shown in Figure 11.2, and as shown, a consequence of the power function is that the waves also become 
flatter at the bottom. 


11.1.2 Gravity Waves 

Gravity waves are the most dominating waves in typical ocean scenes. They arise from the differential 
effect across the water-air interface. A simple model that may be used for small amplitude waves is 
Airy’s Model [Airy, 1845] (George Biddell Airy 1801-1892), where the velocity of the wave propagation 
is proportional to the water height as 


C(x ) 



2nh(x) \ 


(11.4) 


with h being water height and g = 9.82being the gravitational acceleration. As illustrated in Fig¬ 
ure 11.3, tanh(x) —► 1 for x —> oo and tanh(x) —> x as x —» 0. This implies that 


C{x) 



h —> oo, 
h —> 0, 


(11.5) 
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Figure 11.2: Cosine waves may be made sharper at their peak by raising the cosine function to some 
power k, here k = 2.5. 



Figure 11.3: The function tanh is asymptotically 1 at infinity and linear at 0. 


and as a consequence, waves will tend to move perpendicular to shorelines as illustrated in Figure 11.4. 
This is because for a single wave front the parts of the front that arc close to the beach will move slower 
than parts of the back, which arc further away. 

As a wave moves forward, the water particles for an ideal wave typically move in circular paths 
as illustrated in Figure 11.5. An accurate model for a particle in a single wave is the Gerstner Model 
[Gerstner, 1802] (Franz Joseph von Gerstner 1756-1832). Since individual waves move locally in a par¬ 
ticular direction, we may accurately describe the motion of a single water molecule as 


x(t) 


rx 0 A exp ( 2 l z °) sin f 27r(x ° L ~ Ct) ) j 

_z{t)_ 


z 0 + Aexp(^)cx J s( 2n(x °- Ct) ) 


where [xq, zq] t is the position of the particle relative to the surface at some initial reference time, A is 
the radius of the circular path, L is the wavelength, and C is the particle’s speed. At the surface, zq = 0, 
hence 


'x(t) 


[*o Asin( 2 ^° L - Ct) )l 

z(t)_ 


A cos 


(11.7) 
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Figure 11.4: Waves approaching a shore will tend to become parallel with the shoreline 

Particle 



Figure 11.5: For ideal waves, the water particles move in circles, when a wave propagates forward. 


and the particle velocity is in a global coordinate system 


V x (t) 


\ 2 ^ A cos( 2 ^T Ct) )l 


[cos(- ( T Ct) )l 

y z (t)_ 


2^tCA ( 2ir(xo-Cit) \ 

L ° m \ L ) 

— Vo 

sin 


Moving to a local coordinate system 

2tt(xq — Ct ) 

° = I 

implies that 


x(a) 


\x 0 ^sin( 27r(a;o L - Ct) )l 

z(a ) 


A cos 


%r- + Ct — A sin a 


A cos a 


( 11 . 8 ) 


(11-9) 


(11.10a) 

(11.10b) 


Investigating a snapshot by setting t 
xq gives 


0, and analyzing the profile for varying a, which is proportional to 

x(a) 
z(a) 


^ - Asm a 
A cos a 


( 11 . 11 ) 
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Figure 11.6: A trochoid curve is generated by rolling a circle along a line and noting the points swept by 
a line along the radius. If the line is not longer than the radius of the circle, then the curve is a function. 

This is a trochoid with A as the radius of a rolling circle, and ^ is the distance from center to a fixpoint 
being traced on the rolling circle. Examples of trochoids are shown in Figure 11.6, where the top has 
Tjr < .4. middle has ^ = A, and the bottom has -^ > A, which is no longer a model for water waves. 
To generate a wave profile, several water particles are tracked simultaneously, and the profile is generated 
by joining neighboring particles with a line. A procedure for generating Gerstner waves in one dimension 
is given in Figure 11.7. In Figure 11.8 is the output of this algorithm. The figure shows the wave profile 
at a fixed time, together with the trace of a water particle. As can be seen, the water particle is moving 
in an elliptical fashion. The ellipse is a consequence of the nonunit aspect ratio of the graph, since the 
movement is truly circular according to (11.11). Further on a horizontal line in the figure we see the 
project of the x-coordinate. As we can see, the density along the x-axis is highest at the wave top, which 
is convenient, since this is where the curvature of the wave is highest and where there is therefore a need 
for finer sampling in order to preserve the wave profile. 

The ID Gerstner wave is extended to two dimensions in two steps: first, by extending (11.7) with the 
y-coordinate, and then rotating the result around [xq, yo T , 

[xol -A sin ( M^-et) ) 

y(t) = y 0 + R z 0 , (11-12) 

_*(*)J L 0 J [ A cos ( K 2 <^~ ct ) ) 

where R z is the specialized rotation matrix, 

cos 9 sin 9 0 

R z = — sin 9 cos 9 0 , 

0 0 1 


(11.13) 
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algorithm gerstner() 

N = 10 
xO = 1: N 

x = zeros(l,N) z = zeros(1,N) 

A = 0.5 
C = 1 
L = 10 

for t = 1 to N step 0.1 
for i = 1 to N 

x(i) = x0(i) -A*sin(2*pi*(xO(i)-C*t)/L) 
z(i) = A*cos(2*pi*(xO(i)-C*t)/L) 
end 

plot(x,z) 
end 


Figure 11.7: Pseudocode for generating propagating Gerstner waves in ID. 


Gerstner Wave 



Figure 11.8: The Gerstner wave and a circulating particle. 


which rotates a three dimensional point 8 degrees around the 2 -axis. 

Finally, it may be desired to limit the spatial extend of the wave along the y-axis by multiplying with 
a window. 


x{t) 


,T 0 


w{y(t),a)Asm( ( ° 

y(t ) 

— 

yo 

+ Hz 

0 

z{t)_ 


0 


[_ u>(y{t), a) A cos ( ( ° L J 


where a specifies the width of the window. An example of a window function is the unnormalized Gaus¬ 
sian function 


G(x , a) = exp 



which contains 76% of its mass in the interval [—< 7 , <r\. 


(11.15) 
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A consequence of the circular paths of the particles in an ideal wave (11.11) is that there is no net 
transport of water except near the shoreline. In deep water, the particle speed is found as the length of the 
particle velocity (11.8), 


Q 


V x {t) 

v z (t ) 


= Vo = 


2nCA 

L 


(11.16) 


Hence, if the particle speed exceeds the wave propagation speed, then the particles escape the wave and 
the waves break, which typically results in white foam on the top of the waves. That is, wave breaking 
occurs when 


Q > C, 


(11.17) 


which implies that 


2nCA 

L 


>C, 

4 


2ir > 


L 

A' 


(11.18a) 


(11.18b) 


Hence, the breaking of a wave is independent of the wave speed, but only depends on the ration of the 
wavelength L and the wave’s amplitude A. Wave breaking is typically visualized by emitting white 
particles from the top of the wave or by using texture maps to visualize foam [Jensen, 2001]. 

Waves interact by addition. Consider the example of a particle originally at [ x ' q , yo, 0] being moved 
according to a number of unclamped Gerstner waves. Adding the waves gives 


x(t) 


Xo 


y(t) 

= 

yo 

+ Rz 

z{t)_ 


0 



-A sin ^o-Gt) 
0 

A cos ( Mzo-ct) 


+ R' 


-A / sin( 2 ff(!E °T C,f ) 

0 

A' cos 


Xo 

yo 

0 


Xo 

yo 

0 


+ 


+ 


— cos(0)Asin ( Mso-ct) ) _ cos(0 /)^ sin -C't) 
sin(0)A sin ( 2 ^ >-Ct ) ) + sin ^ A > sin *o-C't) ' 

Hcos(^p^) + A! cos + .. 

-S 1 c OS (0,)yt i sm( 2j| X C ‘‘ l 
E i sin(9 i )X i sm( 2 'A7 C “ ) ) 


+ 


ZiAi cos ( Mxo ~ Cit) 


(11.19) 


( 11 . 20 ) 


( 11 . 21 ) 


However, it is not simple to determine the breaking point of the resulting waves, since it requires that the 
resulting velocities of the wave and the particle arc calculated. Wave breaking is therefore often left for 
an animating artist. 

In the literature on partial differential equations. Burger’s equation is often introduced for its simple 
but nonlinear characteristics [Collatz, 1986, Morton et al., 1994]. Burger’s equation models the breaking 
of waves, but it has not yet found practical value in computer animation. 
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Figure 11.9: The substantive or total derivative estimate the time derivative as a volume is moved in a 
velocity field. 


11.2 Fluid Motion 

In this section we will introduce the general subject of fluid mechanics [Tritton, 1988]. A recent survey 
on the origins of fluid mechanics is given in [Craik, 2004]. 

The most important physical properties of fluids are velocity u, density p, pressure p, temperature T, 
and shear viscosity rj and the kinematic viscosity p, which arc macroscopic averages. It is often useful to 
check physical equations according to their units, and the International System (SZ)standard units for the 
above properties arc [u] = ms -1 , [p] = kg m 3 , [p] = N m 2 , [T] = K, [//] = Pas, and [p] = For 
our purpose, they only make sense within some interval of scale much larger than mean free path of the 
molecule and much smaller than the scale of the total system. The selected scale will be considered the 
unit scale in the rest of this chapter. 

A major problem in fluid dynamics is deciding between a coordinate system fixed in space or on the 
particle, also known as the Eulerian and Lagrangian coordinate systems respectively. Although appar¬ 
ently trivial, some of the properties of a fluid such as velocity and acceleration are best described in the 
Lagrangian system, while density, pressure, and temperature arc better described in the Eulerian system. 
In this chapter, we will first discuss the Navier-Stokes equations in the Eulerian coordinate systems along 
the lines of [Tritton, 1988], followed by a discussion of smooth particle hydrodynamics, which is a particle 
approach using the Lagrangian coordinate system. 


11.2.1 Derivative Operators for Vector Fields 

Fluid mechanics heavily relies on the differential notation derived in Chapter 18.3 and summarized in 
Table 26.2. Furthermore, our choice of the Eulerian coordinate system makes the identification of a single 
particle impossible, and applying Newton’s second law on a fixed position would not be correct, since 
different particles occupy a fixed volume at different times. However, the fluid flow gives an indication 
of the average motion of particles per unit volume. As an example, consider the temperature change by 
time as a volume is tracked in a velocity field, where the general situation is depicted in Figure 11.9. 
Assume that we are tracking a particle with coordinates [x(t),y(t), z(t )] in three dimensions, that the time 
varying temperature field is given as T : M 4 — M, and that the temperature of the particle is determined 
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as T(x(t),y(t ), z(t). t). Then the total derivative with respect to t is found to be 


dT 

dt 


8T 8T dx dT dy dT dz 
dt dx dt dy dt dz dt 
dT dT dT dT 
dt + te U+ dj V+ d; W ’ 


(11.22a) 

(11.22b) 


using u = [u, v , w} T as the velocity vector of the particle. This is the change of temperature in the fluid, 
following the fluid, also called the substantial derivative. The substantial operator for a flow u and in a 
dimensionless formulation is given as 


d_ 

dt 


d 

m +u ' v ' 


(11.23) 


where V = V \ x . y jri' is the spatial gradient operator, which will be assumed in the rest of this chapter. 
Newton’s second law of motion in terms of the substantial derivative per unit volume is found to be 


fl Jt u = F 

s t 

d \ 

p[ di + u ' v ) u = F ’ 


(11.24a) 


(11.24b) 


where F is the sum forces acting on the volume. 


11.2.2 Euler’s Equations 

Euler (Leonhard Euler 1707-1783) was the first to formulate an equation of the motion of fluids using 
Newton’s second equation, and the pressure gradient as an internal force [Euler, 1755], augmented with 
an equation for mass conservation under the assumption of incompressible fluids 


d_ 

dt 


+ u ■ V u = —VP, 


V • u = 0 


(11.25a) 

(11.25b) 


and these equations are commonly known as Eider’s equations. 

The mass conservation equation (11.25b) is a consequence of Gauss’ divergence theorem (18.142): 
consider a closed region of a fluid V as shown in Figure 11.10. The total mass in this volume is given as 

mass of V= I p dV, (11.26) 

Jv 

where dV is a unit volume element also known as a three form. The scalar p is the fluid density, calculated 
as the average molecular mass per unit of volume. As the fluid flows, mass will enter and exit this region 
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Figure 11.10: A volume V completely inside the fluid. 


across its surface S. Use dS to denote an area element of the border also known as a two-form, and use n 
as the outward normal to the border. The amount of mass loss may now be calculated as, 


Loss of mass from V = / pu ■ n dS. 


(11.27) 


Setting the time derivative of (11.26) equal to minus (11.27) we get 


d 


d 


— pdV = —P dV = - / pu-ndS. 


dt 


'V 


iv 


dt 


(11.28) 


Letting V shrink until the change in density is constant inside V, the integration in the middle equation is 
redundant, 

r\ r\ 

pdV = --pV (11.29) 


lim 


X 


v^ojy dt dt 

implying that the same limit for the r.h.s. of (11.28) is given as 

d If 

—p = — lim — / pu ■ n dS = —di vpu = —V • (pu), 


dt 


v^o V J s 


(11.30) 


by the definition of the divergence operator, div, and where V = V \ x ,y,z ] T ' s th e spatial gradient operator. 
This is the conservation of mass equation also known as the continuity equation. Often the terms arc 
rearranged as 

^- t P + V-(pu) = 0. (11.31) 

When the density is constant, this implies that 


pV • u = 0, 
V • u = 0. 


(11.32a) 

(11.32b) 


and the fluid is called an incompressible fluid. 
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11.2.3 The Naiver-Stokes Equations 

Euler’s equation ignores the friction between water molecules, and Navier (Claude Louis Marie Henri 
Navier 1785-1836) was the first to derive the equations including friction from a pure theoretical consid¬ 
eration [Navier, 1822], 

p {§t + U ' V ) U = ~ VP + pX/2u + FeXtema1 ’ (11.33a) 

V-u = 0 (11.33b) 

where E exlenial arc the external forces such as gravity and the motion of solid objects in the fluid. Navier 
determined the kinematic viscosity , p, as a function of molecular spacing, but did not attach any phys¬ 
ical significance to it. Stokes (George Gabriel Stokes 1819-1903) later derived the same equations 
[Stokes, 1845], where he made the physical meaning of // clear as the magnitude of the fluid’s viscos¬ 
ity. The kinematic viscosity is related to the shear viscosity as /t = f These equations are therefore now 
known as Navier-Stokes equations. If II Vp|| 3> ||pV 2 rr|| , then the fluid is called a viscous fluid, and in 

2 II 112 

the opposite case, where || Vp|| <C ||//V 2 'u,I , then the fluid is an inviscid fluid. 

2 II 112 

A finite difference solution to the Navier-Stokes equations are found using the Helmholtz-Hodge de¬ 
composition [Harris, 2004], 

Theorem 11.1 (Helmholtz-Hodge decomposition) 

Given a Domain V, its border 5V, normals to the border n, and a scalar field p, then a vector field w can 
be decomposed into 

w = u + Vp, (11.34) 

where V • u = 0, and u ■ n = 0 

The proof is rather complicated, and is therefore omitted in this text; instead, see [Denaro, 2003]. The 
implications of Helmholtz-Hodge decomposition are that 

1. Given a general vector field w, we may then find a scalar field p by taking the divergence on both 
sides of (11.34), 

V • w = V • (u + Vp) (11.35a) 

= V • u + V 2 p (11.35b) 

= V 2 p. (11.35c) 

The equation V • w = V 2 p is a Poisson equation, and is solved straight forwardly with a central 
finite difference approximation. 

2. Given the vector field w and the corresponding scalar field p, we may now calculate a divergence- 
free vector field as 


u = w — Vp. 


(11.36) 
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Object Type 
Concrete 
Rough Wall 
Smooth Plastic 
Radiator 
Open Window 0 no 7). 


Description 
Small amount of turbulence 
Much turbulence 
Laminar flow 
Heat convection 
Heat is blowing into scene 


U\ u 2 T 

0 0 T 0 

—uq 0 To 

uq 0 To 

0 0 T k 


Table 11.1: Some useful border conditions [Foster et al., 1997]. 


Hence, given a solution to the Navier-Stokes equations at time t for u, p, and T exte " ial a numerical solution 
for t + At is obtained by first performing an explicit finite differencing step of (11.33a) to produce a 
divergence-full vector field w , then (11.33b) is solved in two steps: first by solving the Poisson equation 
(11.35c) for p, and then by calculating the divergence-free field u using (11.36). Examples of such 
implementations arc given in [Foster et al., 1997, Harris, 2004]. 

Boundary conditions arc typically handled in a pragmatic manner as follows: a fluid cannot pass 
through a impermeable rigid body, and the relative velocity of the fluid at that point must be zero, or 

u ■ n = U ■ n, (11.37) 

where n is the normal of the boundary and U is its velocity vector. Furthermore, there should be no slip 
between the wall and fluid right next to it, amounting to no relative tangential velocity 

u x n = U x n. (11.38) 

Combining (11.37) and (11.38) we must have 


u = U. (11.39) 

In [Foster et al., 1997], the border conditions in Table 11.1 have been found to be useful. 

A two-dimensional example of pseudocode implementing incompressible, viscous Navier-Stokes equa¬ 
tions is given in Figure 11.11. The gradient of the pressure term in the first Navier-Stokes step may actually 
be ignored, since it is effectively added by the solution to the Poisson equations. This is the so-called pro¬ 
jection method [Harris, 2004]. Nevertheless, we have chosen to keep it in the above code for pedagogical 
reasons. On a personal note, we find it appealing, that although the equations appeal - difficult, the actual 
implementation is rather short. 

11.2.4 Navier-Stokes for Dry Air 

For a gas, Boyle and Charles noted that the three thermodynamic variables pressure, p, molar volume, a, 
and temperature T depends on each other. The molar volume is related to the density, p, the number of 
molecules, n, and the volume, V, as o: = ~ For an ideal gas, the dependency is simply called the 

Ideal Gas Law, 

pV = nRT, (11.40) 
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algorithm navierStokes() 
dt = 1/15 
nu = 10 
h = 0.1 

while(i < T) 

// Perform a step in the Navier-Stokes direction 
du = -u*dx(u) - v*dy(u) + nu*(dxx(u) + dyy(u)) 
dv = -u*dx(v) - v*dy(v) + nu*(dxx(u) + dyy(u)) 
u = u + dt*du 
v = v + dt*dv 

[u,v,p] = setBoundaries(u,v,p) 

// Solve Poisson Equations 
gradDotU = (dx(u)+dy(v))/dTau 
while(max(abs(p-pOld)) > 10 A (-3) 
pOld = p 

p = p + h*(dxx(p) + dyy(p) - gradDotU) 
end 

// Adjust velocity field to become divergence free 
u = u - dx(p) 
v = v - dy(p) 

// Set boundary conditions appropriately 
[u,v,p] = setBoundaries(u,v,p) 

plotVectorField(u,v,p) 
i = i+di 
end while 
end algorithm 


Figure 11.11: Pseudocode for implementing incompressible viscous Navier-Stokes equations. The un¬ 
specified functions dx, dy, dxx, and dyy perform central finite differencing of first and second order, 
setBoundaries clamps values according to chosen boundary conditions, and plotVectorField 
presents the resulting vector field in some suitable graphical manner. 
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where R = 8.314—is the ideal gas constant. Ideal gases do not exist, but arc nevertheless a 
good approximation of gas behavior under normal conditions, which is attractive in computer graphics 
that usually arc concerned with animating everyday phenomena. Disregarding chemical, magnetic, and 
electrical effects, the first law of thermodynamics states that the rate of heat addition, is 


dq dE[ nt dW 
dt dt dt ’ 


(11.41) 


where is the internal energy and W is the work done by the gas on its surroundings due to expansion. 
For an ideal, inviscid gas, 


dEj nt dT 
dt v dt ’ 
dW da 

~dt ~ P ~dt' 


(11.42) 

(11.43) 


where c v is the specific heat coefficient for constant volume. There is also a specific heat coefficient for 
constant pressure, c p , and these arc related for a mono-atomic ideal gas as 


c v — R, 


(11.44) 


implying that [Haltiner et al., 1980], 

dq dp f R\ da dp c p da 

— = a— +14- p-r- = a— 4- p— 

dt dt \ c v ) dt dt c v dt 


(11.45) 


The equations (11.30), (11.33a), (11.40), and (11.43) constitute a complete system of equations describing 
the motion of dry air. 


11.2.5 Navier-Stokes with Heat Convection 

For a incompressible, viscous fluid, the Boussinesq approximation is often used [Tritton, 1988]. In the 
Boussinesq approximation, the only fluid property considered is the density, and it is only considered 
when the density variation gives rise to a gravitational force. Assuming that the density varies slightly 
from some reference density p$, for example, in a neighboring point, then the density in a particular point 
as 

p = p 0 + Ap (11.46) 

Since we arc ignoring nongravity effects, the continuity equation (11.30) is simply V • u = 0, and we 
calculate the change in moment as pojjju. However, gravity effects causes the third term in (11.33a) to 
become, 

F = pg = poQ + Apg (11.47) 

The gravitational acceleration can be rewritten as the gradient of a scalar field, g = —VC/, using G = gz 
with z being the direction up. Newton’s second law for fluids (11.33a) is then written as 

Po~jj; u = — V (jp + pqG) + pW 2 u + A pg = —VP + p\/ 2 u + A pg. (11.48) 
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with P = p + [>(](’■ Hence, if the density is constant, then (11.48) simplifies to (11.33a) except for 
the correction of the pressure term for the height. If the pressure does not appeal - directly in the boundary 
conditions, this correction of the hydrostatic pressure has no physical significance. When there is a density 
variation, then the last term will cause an effect when ^u <C g, and in this case, we may use a linearizing 
of the density variation 

A p = — ypoAT (11.49) 

with 7 as the coefficient of expansion of the fluid. Inserting (11.49) into (11.48) gives 


Po-jru =-VP + gV 2 u -'yp 0 ATg. (11.50) 

at 

The term pp^ATg is known as the buoyancy force. An additional equation for the temperature is needed, 
and in the Boussinesq approximation it is assumed that 

^-„c,±T-kr^T, <11.51, 


where kr is the thermal conductivity coefficient. See [Tritton, 1988] for arguments for using c p instead of 
c v . In a fixed coordinate system, the temperature field is thus governed by 


dT 

~di 


4Cv 2 r-v.(T U ) + 4_^. 

pCy clt 


(11.52) 


Equations (11.30), (11.50), and (11.51) are the basic computational fluid dynamical equations with con¬ 
vection. 


11.2.6 Navier-Stokes with Concentration Variations 

Analogous to (11.49), the Boussinesq approximation may be used to model concentration variations as 

Ap = "fpoAc (11.53) 

where c is the concentration. Newton’s second law then becomes 

po-^u =-VP + pV 2 u-'ypoAcg. (11.54) 

at 

Again, analogous to heat convection, the concentration is governed by diffusion. 


d 

pCp Jt° ~ 


k c V 2 c, 


(11.55) 


where k c is the thermal conductivity coefficient. 
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Figure 11.12: Snapshots from a shallow-water animation. 


11.2.7 Shallow Water 

So far, the discussion on Navier-Stokes equation have been for fluids fully enclosed in some solid volume. 
In contrast, the Gerstner equation discussed in Section 11.1.2 is mainly concentrating on the water sur¬ 
face. A connection is given through the shallow water equation [Saint-Venant, 1871, Flaltiner et al., 1980, 
Layton et al., 2002], and an example of a shallow-water computation is shown in Figure 11.12. The shal¬ 
low water equation is much more powerful than the Gerstner equation, since it is dynamic and allows for 
easy interaction between the fluid and boundaries. The shallow-water equation assumes an incompressible 
and inviscid fluid, and they will be derived in the following. 

Assume a column of fluid of height, h, and base area B. Then the total mass of fluid at the base over 
the area B is 

m = hBp , (11.56) 

which corresponds to a force on the base and in the direction of gravity as 

f = mg. (11.57) 

The pressure at the base is therefore 

p=^=hgp. (11.58) 

Hence, the pressure term in (11.33a) simplifies as 

Vp = gpVh. (11.59) 
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For large-scale motions of the fluid, the vertical acceleration may be ignored, and the fluid is said to be in 
hydrostatic equilibrium. This implies that Newton’s second law of motion simplifies to 

p T u • v) u = -gpVh + pV 2 u + F extemal , (11.60) 

where only changes in the horizontal plane arc nonzero, that is, the above equation reduces to the two, 


(du du du\ dh f d 2 U d 2 u\ _, externa | 

p {m +u d^ + v d^J = ~ 9 P fa +fi (d^ + w) + 1 ’ 

(dv dv dv\ dh (d 2 v d 2 v\ ^external 

p {dt +u fa +v d^J = ~ 9 % +p (d^ + w) + 2 ‘’ 


(11.61a) 


(11.61b) 


for u = [n, v. w] T . The mass preservation in terms of the height is found by first integrating (11.30) by z, 


du dv dw 

dx dy dz ’ 


(11.62a) 


[ h du dv dw , fdudv\ 

/ o—^ a — = a —^ 7T~ J h + Wh — wq = 0. 
/q ox oy uz \ ox Oy J 


(11.62b) 


Since wq is the vertical speed at the bottom, this term must vanish. On the other hand, vj /, is the rate of 
rise at the fluid surface, hence 


w 0 = 0, 


(11.63a) 


Wh = Tf 


Inserting into (11.62b) it is found that 


du dv \ _ dh _ dh dh dh 
dx + dy) dt dt ° dx V dy 


(11.63b) 


(11.64a) 


du dv\ , dh dh dh 

gi + W + at + tt te~”% =a 


(11.64b) 


The shallow water equations arc now obtained by using _F cxternal = [0,0, g\ T and assuming inviscid fluid, 
that is, that ||^V 2 m|| <C ||Vp|| , hence 


du du du dh 

~di +U d^ + V d^ = ~ 9 d^, 

dv dv dv dh 

m +u fa +v d^ = ~%’ 

dh dh dh f du dv\ , 

at +u di + v dt; = -{di + dti) 


(11.65a) 


(11.65b) 


(11.65c) 
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A simple finite difference using central approximations and zero flow at the borders may be used, how¬ 
ever the implicit semi-Lagrangian time integration scheme as implemented in [Layton et al., 2002] is 
superior in performance and stability and has been used to produce the animation shown in snapshots in 
Figure 11.12. 

The implicit semi-Lagrangian time implementation of the shallow water equations is based on the 
Eulerian formulation given above, where the physical parameters arc computed on the grid. However, the 
velocity field is approximated, as if it had been calculated by following individual particles, which yields 
a semi-Lagrangian formulation. That is, the substantial derivatives in (11.65) arc replaced with ordinary 
derivatives 


du 

dh 


dt 

9~n ) 
ox 


dv 

dh 


dt 

dy 


dh 

f du 

dv 

dt 

\dx 

dy 


(11.66a) 

(11.66b) 

(11.66c) 


and the ordinary derivatives arc estimated by tracking a hypothetical particle located at grid point (i. j) 
using forward differences. 


du 

dt 

dv 

dt 

dh 

dt 


u(t + A t) — u(t) 
A t 

v(t + At) — v(t) 
At 

h(t + At) — h(t) 
At 


+ O(At), 
+ 0(At), 
+ 0(At). 


(11.67 a) 
(11.67b) 
(11.67c) 


The functions u, v, and h arc only given implicitly in the Eulerian formalism and have to be estimated. 
Using the current values of u, v, and h at grid point (i, j), we assume that these arc a good approximation 
of their values in the previous step, and the departure point of the hypothetical particle at (i, j) is thus 


assumed to be 


x(t — At) 
y{t - At) 


x{t) 

y(t) 


At 


u{t) 

v{t) 


+ O(At). 


( 11 . 68 ) 


We may thus estimate the values of u, v, and h by bilinear interpolation, that is, 


u = 

= (1- 

a)(l - 

b) u{[x\, 

m 

+ (1 - a)b 

n(LxJ 




+ 

ab u( \x 

Uyl) + 

a(l - 

~ b) u(\x], 

m, 


(11.69a) 

V = 

= (1- 

o)(l - 

b) v{[x\, 

LyJ) 

+ (1 — a)b 

v([x\ 




+ 

ab v{\x 

i,r*n) + 

a(l - 

- b) v(\x~\, 

m 


(11.69b) 

h = 

= (1- 

a )(l — 

b) h([x\, 

m 

+ (1 - a)b 

h{[x\ 

Jyl) 



+ 

ab h(\x 

i,r*n) + 

a(l - 

~ b) h( 

m- 


(11.69c) 
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where a = x — [x J and b = y — [y\, and [• J and \•] arc the floor and ceiling operators respectively. 
Discretizing (11.66) in time implicitly and using (11.67) gives, 


u(t + At) — u(t) dh(t + At) 
— = -g- 


At 

v(t + At) — v(t) 


dx 

dh(t + At) 


At 9 dy 

h{t + At) — h(t) { du(t + At) dv{t + At) \ 

At \ dx dy ) 

To produce a single equation, we first calculate the derivative (11.70a) with respect to x, 


d u(t + At) — u(t) 
dx 


At 


d dh{t + At) 
dx9 dx 


4 




At 

assuming that g is independent on x. This implies that 

du(t + At) du{t) 


~ &tg 


dx 2 


d 2 h{t + At) 


dx dx a dx 2 

Likewise, we calculate the derivative of (11.70b) with respect to y to give, 


d v(t + At) — v(t) 
dy 


At 


dv(t-\-At) 

dy 


d dh(t + At) 
dy9 dy 


4 


which implies that 


At 

dv{t + At) dv(t) 


%y~ _ gd 2 h{t + At) 


- A tg 


dy 2 

d 2 h{t + At) 


dy dy ^ dy 2 

We can now eliminate u{t + At) and v(t + At) in (11.70c) using (11.72) and (11.74) as, 

hit + At) — hit) f du(t) d 2 h{t + At) dv(t) d 2 h{t + At) 
-At-- + ^-~ At3 - 


and when we rearrange the terms we find, 
h(t + At) = h(t) — 


A th(t) ( - At /' i( ( + At| + - At 9 


\ dx 


dx 2 


dy 


dy 2 

d 2 h(t + At) 
dy 2 


(11.70a) 
(11.70b) 

(11.70c) 

(11.71a) 

(11.71b) 

(11.72) 

(11.73a) 

(11.73b) 

(11.74) 

h(t), (11.75) 


(11.76) 
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Finally, a numerical scheme is obtained by using first- and second-order central differences for the spatial 
derivatives to give 


h(x, y,t + At) = h(x, y, t) 


du(x,y,t ) 


— Ath(x, y, t ) 


d 2 h(x , y,t + At) 

dx ' (te 2 

dv(x,y,t) i /a ^ 2 , ^„d 2 h(x,y,t + At) 


- Ath(x, y, t) A^2^1Zl + (A t) 2 h(x, y, t)g 


dy 


+ (A t) h(x,y,t)g- 


dy 2 


= h(x, y, t) 


— Ath(x, y, t) 


u(x + l,y,t) -u(x + l,y,t) 


(11.77) 


+ (A t) 2 h(x,y,t)g 


2Ax 

h(x + 1, y, t + At) — 2h(x, y,t + A/) + h(x — 1 ,y,t + At) 


Ax 2 


- Ath(x,y,t) 


v(x,y + l,t) - v(x,y - 1 ,t) 


2 Ay 


+ (At) 2 h(x, y, t)g 


h(x, y + l,t + At) — 2h(x, y,t + At) + /i(x, y — 1, t + At) 


Ay 2 


(11.78) 


This equation is nonlinear - since it contains products of /i’s. Nevertheless, terms involving h (t + At) may 
be isolated. 


h(x, y, t + At) - (A t) 2 h(x, y, t)g 


h(x + 1, y, t + At) — 2/i(x, y, t + At) + h(x — 1, y, t + At) 

Ax 2 


+ (A t) 2 h(x,y,t)g 


= h(x,y,t) - Ath(x,y,t) 
- Ath(x,y,t) 


h(x, y + 1, t + At) — 2h(x, y, t + At) + h(x, y — 1, t + At) 

Ay 2 

u(x + 1 , y, t) — ft(x + 1 , y, t) 

2Ax 

F(x,y + 1, t) - v(x,y- 1 , t) 


2Ay 


(11.79) 


and when assuming that h(x,y,t) is constant on the left-hand side the system may be solved as a linear 
tridiagonal system for h(t + At) based on hit). The entire algorithm is shown in Figure 11.13. Although 
the shallow water equations do not contain any diffusion terms, the waves will slowly diminish, since the 
velocities are smoothed implicitly by the bilinear interpolation and the time integration performed in the 
numerical scheme. 

The boundary conditions are important for this method. It is our experience that setting the derivative 
of h across the boundary to be zero, and setting u and v to be zero at the boundary, gives useful results. 
This implies that the height-field is relatively unaffected by the boundary, while the velocity outside the 
fluid is zero. More technically, the boundary condition on h is of the type von Neumann described in 
Section 9.3.7, while the boundary condition on u and v is of the type Dirichlet described in Section 9.3.8. 
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algorithm shallowWater(float h[][], int N, float T) 
u = zeros(N,N) 
v = zeros(N,N) 
dt = 0.1 
g=9.82 
t = 0 


while(t < T) 

t=t+dt // Perform a step in the Shallow-Water equation 

// Estimate departure points using (11.68) 

[xfrom,yfrom] = computeDeparturePoint(x,y) 

// Estimate values of u,v,h at departure points using (11.69) 

[hTilde, uTilde, vTilde] = EstimateDepartureState(h,u,v,xfrom,yfrom) 

// Solve for h(t+dt) using (11.79) 

h = ShallowWaterStep(h,u,v,g,hTilde,uTilde,vTilde) 

// Update u(t+dt) and v(t+dt) using (11.67) 

[u,v] = UpdateVelocities(h,u,v,g,uTilde,vTilde) 
end 

end algorithm 


Figure 11.13: Pseudocode for the shallow water equation. 


11.3 Smoothed Particle Hydrodynamics 


In the Eulerian view of fluid dynamics, the fluid is considered a density and it is therefore complicated to 
attribute notions of velocity and acceleration to it. In contrast, in the Lagrangian view, the fluid is consid¬ 
ered a collection of particles, for which velocity and acceleration are natural descriptors, but in contrast, 
parameters such as density, pressure, and temperature arc more difficult to define. In the following, we will 
describe smoothed particle hydrodynamics, which is a Lagrangian implementation of the Navier-Stokes 
equations, where kernels arc the central issues for estimating the parameters of density, pressure, and 
temperature. This text is based on [Shao et al., 2003, Muller et al., 2003, Roy, 1995, Monaghan, 1992, 
Monaghan, 1988], 

Consider a fluid represented as a collection of particles. Each particle represents a certain volume V t , 
and carries position, mass, and velocity. We will attribute each particle with other quantities such as fluid 
density p, = rrii/Vi, pressure p t , and temperature T t , however it does not make sense to talk of the density 
of a particle, since this is a macroscopic quantity. These macroscopic quantities should therefore only be 
obtained as weighted averages of the neighboring particles. Assuming the general case, we are given a 
scalar quantity Aj for each particle j. Then in any position r we may interpolate this scalar quantity using 
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Gaussian K(r ) 
B-Spline K(r) 

Q-spline K(r ) 


fc ex P 


- r 


T. < 


(2 

3 

(2— ||r 


6 


lo, 


1.5 



0< ||r|| 2 1, 

1 < ||r|| 2 < 2 

otherwise. 


i < 


f(3 

(3 

(3 

0 


r||J 5 -6(2-||r|| 2 ) 5 + 15(l- 

r|| 2 ) 5 -6(2-|M| 2 ) 5 , 

r ll 2 ) 5 ’ 



0 < IIr-II < 1, 

1 < \\r\\l<2, 

2<IMI 2 <3, 

otherwise. 


Table 11.2: Some smoothing kernels often used in smoothed particle hydrodynamics. For simplicity, we 
have used W (r, a) = K{r/a), and k is the normalization constant enforcing (11.81c). 


a spatial averaging function W as follows: 


N 

A( r ) = "Y2 Aj—^-W(r — rj, o). (11.80) 

3=1 Pj 

The particular - choice of will be discussed below. Let us for the moment concentrate on the function 
W. This function is known as an averaging function or smoothing kernel , and o is some parameter relating 
to its width. For the interpolation to be useful, we will have to require a number of constraints on W: 


W(r,a ) > 0, 

W(r,cr) = W(—r,a), 


I W(r ) dr = 1, 


(11.81a) 

(11.81b) 

(11.81c) 


which in words are positivity, rotational symmetry, and unit integral. Positivity is necessary in order to 
ensure that it is an averaging function. Rotational symmetry is often useful to ensure invariance w.r.t. 
rotations of the coordinate system, and unit integral ensures that maxima and minima are not enhanced. 

Many kernels have been proposed in the literature, some of the more popular - are shown in Table 11.2. 
In [Hongbin et al., 2005] these have been compared for stable fields. A stable field is a flow field that 
does not vary with space and the comparison has been performed w.r.t. the kernel’s estimation error of 
the function and first-order derivatives. It is concluded that the Gaussian and the Q-spline are the best 
kernels, and since the Gaussian is the kernel introducing the least structure [Weickert et al., 1997], in the 
remainder of this chapter, we will only consider the Gaussian kernel. The isotropic Gaussian kernel in n 
dimensions is given by 


(27TCT 


T ex P 
2^2 


— r T r 
~2o^~ 


G{r ,cr) 


(11.82) 
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using a > 0. It may be verified, that the Gaussian fulfills the constraints in (11.81). The Gaussian kernel is 
infinitely differentiable, and since it is the only term in (11.80) that depends directly on space, we conclude 
that (11.80) is also infinitely differentiable. That is, the partial derivative of A w.r.t. space is given as 


d r 


N 


dx a dyPdz~ / 


A ( r ) = Pi 


rrij 


gn 


3 =1 


Pj dx a dyPdz^ 


G(r - r 


3i 


cr 


(11.83) 


where n = a + (3 + 7 . In Figure 11.14 is a graph of a one-dimensional Gaussian together with its first- 
order derivative. The Gaussian is special in many respects: the Gaussian maximizes uncertainty when only 
mean value and standard deviation are known [Shannon et al., 1949], and it is the kernel that introduces 
the least structure when varying its scale, <7, [Weickert et al., 1997], Since a may be seen as proportional 
to downsampling of the domain with a factor <7, we will refer to <7 as the scale of the kernel and hence of 
(11.80). 

It should be noted that the Gaussian only fulfills (11.81c) on a continuous, infinite domain such as M 3 
or a 3D torus. Flowever, approximations may be used for discrete, finite size domains by sampling the 
Gaussian and renormalizing the Gaussian to enforce (11.81c). This will be assumed in the following. 

Now let’s return to the factor -y- appealing in (11.80). By using pj = -pf, where Vj is the volume 

implicitly attributed to the particle j, we find that '-y- = Vj. Furthermore, consider the interpolated density 
function. Using (11.80) with p instead of A we find 


N 

p(r) = ^2pj— w i r ~ r j,°) (11.84a) 

3=1 Pj 

N 

= ^2 m jW(r — rj,o). (11.84b) 

3 =1 

For a finite domain of volume V and properly renormalized Gaussian, then W — > y as a —> 00 , hence 
p = y m j, which is the correct density at infinite scale. This is certainly the correct density, when 



“book” — 2005/9/30 — 15:44 — page 394 — #406 


394 


CHAPTER 11. COMPUTATIONAL FLUID DYNAMICS 


viewing the system from infinitely afar. Likewise, when a —► 0 then G —► d(\\r — r. ; ||), where <5 is the 
Dirac delta function and defined as 


oo, if x = 0, 

0, otherwise, 

I S(x) = 1 

This implies that that the density is infinite at each particle, which although physically unrealistic, cor¬ 
responds to assigning the mathematical mass rrij to an infinitely small volume of space, which naturally 
would result in infinite density. We thus conclude that for the density function, the coefficients arc 
collect. 

Now consider the more complicated example of an incompressible, homogeneous Navier-Stokes equa¬ 
tion for an ideal gas and including a temperature gradient. In the Eulerian view, these equations arc as 
follows: 



(11.85a) 

(11.85b) 


8U . 1 r—r 1 } „ 

— = —(u ■ V)u -Vp + vV~u + F, 

at p 

(11.86a) 

V • u = 0, 

(11.86b) 

P)' j ' 

— = AV 2 T - V • (Tu) 
dt v ' 

(11.86c) 

pV = nRT 

(11.86d) 


However, smoothed particle hydrodynamics is based on the Lagrangian view, where particles arc treated 
rather than densities. Therefore, we should not use the normal derivative instead of the substantiate deriva¬ 
tive, which simplifies (11.86a) to 


1 9 

a, = —Vp + vYJ^u + F, 
P 


(11.87) 


Further, and in contrast to (11.86b), conservation of mass is simply obtained by neither removing nor 
introducing particles. 

According to (11.80) and (11.83) the gradient of pressure is calculated as, 


—Vp(r) 


N 


J2 p j ~T VW (r-r j ,t 

3=1 Pj 


( 11 . 88 ) 


First, note that due to (11.81b) W is an even function, which implies that the derivatives along each 
axis is an even function, hence antisymmetric, and VW(0, a) = 0. Nevertheless, (11.88) is not without 
problems. Consider a two particle system distance dr = r i — r-2 apart with identical densities and mass, 
p = f>\ = f >2 and m = rn\ = m 2 . Using (11.88) we find the force from the pressure gradient at each 
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particle to be 


—Vp(ri) = — p 2 — VW(dr, a), 

P 

771 

—Vp(r 2 ) = — pi — \7W(—dr, a) 
P 

= pi—VW(dr, a), 

P 


(11.89a) 

(11.89b) 

(11.89c) 


where we have used the antisymmetry of the gradient of W in (11.89c). Thus, when p \ / p 2 , the 
magnitude of the forces differ on each particle, and the system therefore does not fulfill Newton’s first 
law. 

To enforce Newton’s first law for the pressure gradient, a simple symmetrization has been suggested 
[Muller et al., 2003], where the smoothed pressure gradient is calculated as 

N _ 

-Vp(ri) = ~^2 Pl Pjmj VW(ri - rj,a). (11.90) 

j=i Pj 

For the two-particle system above, this results in a symmetric force on each particle 

-Vp(ri) = + V2 — VW(dr,a) (11.91a) 

2 p 

= 'Vp(r 2 ). (11.91b) 

Similarly, the smoothed viscosity is also antisymmetric for two-particle systems 

N 

v\7 2 v(r) = V 2 W(r — rj,a). (11.92) 

3 =1 Pj 


In this case, it has suggested to symmetrize by considering velocity differences [Muller et al., 2003] 

N 

i'V 2 v(r.i) = v'Y 2 ( v j ~ Vi)^-\ 7 2 W(r — rj,a). (11.93) 

3 =1 Pj 

Finally, the surface of a smoothed particle hydrodynamical system may be expressed as the surface 
where the density (11.84) is given as some user-specified constant c as 

S = {r | p(r) = c}, (11.94) 

and the surface may be tessellated, for example, using marching cubes [Lorensen et al., 1987] or splatting 
[Rusinkiewicz et al., 2000]. The gradient of the surface is useful for visualization, and it may be found as 
the gradient of the density take where p = c, that is, 


n = {Vp | p(r) = c}. 


(11.95) 
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Finally, the smoothed particle hydrodynamics may be augmented with a surface tension term 

F surface =r/mi (H. 96) 

where r is the surface tension coefficient, and h is the mean curvature, which is readily obtained as 
[Florack, 1997, p. 167], 

J 1 (Py + Pz)pxx + (Px 3 " Pz)Pyy 3 “ (Px 3 “ Py)Pzz — ^PxPyPxy ~ 2 PxPzPxz ~ 2 PyPzPyz 

‘~2 Tl+pf+TIF 5 


(11.97) 
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Figure III. 1: The simulation loop. 


During the course of physics-based animation, objects come in contact with each other. The contact 
region must be resolved so that objects bounce or slide off each other or come to rest upon each other. 
In order to simulate these effects it is necessary to infer details about the shape of the contact regions. 
This is the aim of collision detection in physics-based animation. Collision detection is therefore a purely 
geometrical problem. 

The simulation loop is the simplest way of describing the interaction between a simulation method and 
the collision detection. The simulation loop [Mirtich et al., 1995] illustrates the basic interaction between 
collision detection and the simulation method as shown in Figure III. 1. 

First, the simulation method updates the position of the objects and/or their shape, then the collision 
detection finds the regions of contact or penetration between the objects, and finally the regions of contact 
are reacted to in a proper physical manner by the simulator. 

In physics-based animation, collision detection often becomes the bottleneck, since a collision query 
needs to be performed in every simulation step in order to determine contacting and colliding objects. 
Animations can have many objects, all of which may have a complex geometry, such as polygonal soups of 
several thousands facets, and it is therefore a computationally heavy burden to perform collision detection, 
and it can be difficult to obtain real-time interaction. 

Hubbard [Hubbard, 1993] was among the first to address these specific kinds of problems encountered 
in physics-based animation. Initially, collision detection appeal's to be an 0{n 2 ) algorithm, since any two 
triangles may be intersecting. Instead of running an expensive pairwise collision test between every pair 
of objects, Hubbard introduced the concepts of broad-phase and narrow-phase collision detection still 
widely in use today. The main idea is to reduce the computational load by performing a coarse test in 
order to prune an unnecessary pair test. This is broad-phase collision detection. 

The actual reduced pairwise testing is the narrow-phase collision detection. After the narrow-phase 
collision detection, a postprocessing phase, called contact determination [Bouma et al., 1993], is usually 
carried out. Its purpose is to take the output from the narrow-phase collision detection and turn it into 
meaningful data for the simulation method. It is possible to further extend the collision detection with 
a fourth phase, termed the spatial-temporal coherence phase. The spatial-temporal coherence phase is 
essentially responsible for exploiting cached information and breaking down the computation in smaller 
independent pieces [Erleben, 2005]. 


© 


© 


© 


© 






“book” — 2005/9/30 — 15:44 — page 400 — #412 


400 


COLLISION DETECTION 



Figure III.2: The three phases of collision detection. 


Part IV is dedicated to the three phases: broad-phase collision detection, narrow-phase collision de¬ 
tection, and contact determination. These arc depicted in Figure III.2. Due to space considerations we 
will limit ourselves to treat only polygonal objects. Paid IV is organized as follows: in Chapter 12 we will 
introduce the reader to basic principles, and treat broad-phase collision detection in detail. In Chapter 13 
we will introduce the reader to narrow-phase collision detection algorithms, and in Chapter 14 we discuss 
contact determination. The remaining chapters, Chapter 15, Chapter 16, and Chapter 17 are dedicated to 
specific treatment of various narrow-phase collision detection algorithms. 
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It is computationally very expensive to test two arbitrary objects for collision and to determine their contact 
regions. As a simple example, imagine a scene consisting of two complex tentacle objects, like the ones 
shown in Figure 12.1(a). In this case, each object has roughly 10,000 faces. A naive approach would 
be to clip every face from an object against the faces of the other object. Such an algorithm would, in 
the worst-case, have a quadratic running time, 0(rn 2 ), where m is number of faces; this brute approach 
is therefore seldom used in practice. A better approach is to store an object in a spatial data structure 
as described in Chapter 15, which often brings the worst-case time complexity down to that of doing a 
complete traversal of a treelike structure, 0(rn log m). Even better would be to use a signed distance map 
approach as described in Section 17.1, which would consist of looking up every vertex of one object in a 
distance field of the other object. This requires 0(m), assuming there is an upper bound on the number 
of vertices in a single face, as often is the case for triangular meshes. In special cases for rigid and convex 
objects an expected 0(1) time complexity can be achieved. To summarize, for two arbitrary deforming 
and moving tentacle objects with a total of m faces, the worst-case time complexity ranges from 0(m) to 
0(m 2 ). Since m can become quite large, it is easily seen that in the general case, the pairwise testing will 
be computationally intensive. Doing physics-based animation with only two objects is often not enough, 
and even simple scenes often have a handful of objects, as shown in Figure 12.1(b). This is called the 
n-body problem , n being the number of objects. 

We can deal with the n-body problem by considering every possible pairwise test of the n objects in 
the scene. In the worst-case this would require 

( 12 . 1 ) 

pairwise tests. However, in practice it is rare that all objects are momentarily in contact with each other. 
For instance, a single sphere would expect, on average, to be in contact with at most six other spheres in a 
box full of rigid spheres, and in such a case we would expect only 0(6n) pairwise test to result in actual 
contact being detected. 

Since objects in physics-based animation is prohibited from deeply interpenetrating each other, we 
can, on average, expect to do better than 0(n 2 ) pairwise testing, and one of the goals of broad-phase 
collision detection is to quickly prune expensive and unnecessary pairwise tests. It is important to observe 
that broad-phase collision detection does not change the worst-case time complexity; nevertheless it does 
ease the workload. 

Figure 12.2 shows how a broad-phase collision detection algorithm can be used in a simple colli¬ 
sion detection engine. In order for a broad-phase collision detection algorithm to be efficient, we stride 
toward an algorithm capable of pruning an unnecessary pair test in constant time and with a very low 
time constant. From computational geometry [Berg et al., 1997, O’Rourke, 1998], we know traditional 
algorithms, such as: point location and range searching, as illustrated in Figure 12.3. These algorithms 
efficiently solve problems, such as 
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(a) 2 objects, 11066 faces each (b) 20 objects = 190 pairs 

Figure 12.1: Illustration showing why collision detection is computationally heavy. 

algorithm collision detection(C:Contacts) 
pairs = broad-phase() 

clear C for each pairp in pairs do 
proximity = narrow-phase( p) 
contact = contact-determination(proximity) 
add contact to C 
next p 

end algorithm 


Figure 12.2: A simple collision detection engine. 

• Which box contains a given point? 

• Which boxes overlap a given box? 

This seems to be useful for broad-phase collision detection algorithm. Therefore, an initial idea would be 
to approximate each object by a box, and then apply the traditional computational geometry algorithm. 
Unfortunately, such an approach is doomed to be inefficient in the context of physics-based animation, 
since two important things have escaped our attention: we need to consider all possible pairs of overlap, 
and these need to be considered as time progresses forward in a simulation. In conclusion, in broad-phase 
collision detection we want to solve problems, such as 

• Which boxes overlap each other? 

• Which boxes overlap in the next iteration? 

These arc also known as multiple queries and all-pair problems. 
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k-D Tree 


(a) Point Location Data Structure: line end-points are 
used to create horizontal lines, creating an 
arrangement of trapezoids. 


(b) Range Searching: a recursive subdivision of 
space is done by splitting space into two halves at 
every point. 


Figure 12.3: Traditional algorithms from computational geometry. 


In physics-based animation, objects move and deform over time, e.g., in Figure 12.4 frames from 
a simple rigid body simulation arc shown, and the broad-phase collision detection algorithm should be 
run at each of these frames. To perform fast simulation, we need an algorithm with low complexity 
and low constants, such as dynamic algorithms, which we’ll discuss later. In the following, we will 
introduce the four principles used for constructing a dynamic algorithm, followed by an outline of three 
popular algorithms for doing broad-phase collision detection: Exhaustive Search, also known as the All- 
Pair Test, Coordinate Sorting, also known as Sweep and Prune, and Hierarchical Hash tables, also known 
as Multilevel Grids. 

12.1 The Four Principles for Dynamic Algorithms 

To obtain fast broad-phase collision detection algorithms the following four principles arc helpful: 

• The Approximation Principle 

• The Locality Principle 
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(c) 2.43 sec (d) 2.71 sec 

Figure 12.4: Frames of a rigid body simulation. 
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Figure 12.5: The Approximation Principle. A complex tentacle geometry is approximated by an AABB, 
a sphere, an OBB, and a cylinder. 


• The Coherence Principle 

• The Kinematic Principle 

The Approximation Principle Complex geometry of an object can be approximated with a simpler 
geometry, which is known as the Approximation Principle principle!approximation. Some frequently 
used approximation geometries arc: 

• Spheres 

• Axes Aligned Bounding Boxes {AABB) 

• Oriented Bounding Boxes {OBB) 

• Cylinders 

These arc illustrated in Figure 12.5. The choice of approximation geometry has an impact on the time 
constants of a broad-phase algorithm, and the general rule of thumb is that more complex geometry means 
higher time constants. 

Here, complexity is used vaguely; the number of parameters needed to describe the size and location 
of a volume is one definition: a sphere needs four parameters, an AABB needs six parameters, a cylin¬ 
der needs eight parameters, and an OBB needs nine parameters. However, there are other definitions, 
for example, describing the shape by the order of curved surfaces or the number of faces. The theory 
of shape complexity is huge, and the interested reader may wish to investigate Stochastic Complexity 
[Rissanen, 1989] and Fractal Dimensions [Mandelbrot, 2004, Halsey et al., 1986] 

Typically, a scene consists of many objects in the order of 100-1000 and for a dynamic, real-time 
animation, we will have to approximate many objects frequently. This indicates that it should be easy 
and fast to find the approximating geometry. In contrast, more complex geometry means better pruning 
of unnecessary pair tests. The choice of tradeoff is thus si mi lar to the choice of volume type in bounding 
volume hierarchies, as discussed in Chapter 15. However, we know of no work examining this trade-off 
for broad-phase collision detection. We believe that simpler geometries will be preferable, since a more 
complex test is done later by the narrow-phase collision detection. In the broad-phase collision detection 
algorithms we have chosen to present in this book, we have decided to use AABBs because: 
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Figure 12.6: Sides of AABB arc orthogonal to the coordinate frame, whose axes arc shown with arrows. 

• They arc easy and fast to compute given a maximum distance from the center of mass of a body to 
any point on the body surface 

• They arc fast to do an overlap test with 

• They generalize to any dimension 

This is also true for spheres. However, later we will present two algorithms: sweep and prune and hierar¬ 
chical hash tables, that only work with AABBs. The exhaustive search algorithm, which we will present 
can use any kind of volume. In practice, as a rule of thumb, one should choose the simplest volume that 
can be used with the chosen algorithm and which best describes the shape of the objects. As an example, 
if one chooses the exhaustive search algorithm and has sphere-like objects, then a sphere volume is going 
to be the best choice. We will now go into the details of an AABB. 

An AABB is a box whose sides arc orthogonal to the world coordinate system (Wes') axes as illustrated 
in Figure 12.6. Sometimes other coordinate frames arc chosen, e.g., the model frame in which the object 
was modeled, however, for the purpose of broad-phase collision detection, we will only consider the WCS. 

We can represent an AABB by two points: the minimum point r mm and the maximum point r max . 
These are defined as the comers of the AABB with minimum and maximum coordinates, as illustrated in 
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Figure 12.7: Minimum and maximum points of an AABB. 


Figure 12.7. 

A brute-force approach for computing the minimum and maximum point is to iterate over every face 
vertex and determine the minimum and maximum coordinates as illustrated in Figure 12.8. This will result 
in the tightest-fitting AABB with the best pruning capabilities. Unfortunately, this approach is infeasible 
for objects with many vertices as the ones shown in Figure 12.1(a). The brute method is sometimes 
the only alternative; for instance, if an object is deforming and there is no way to predict the bounds of 
deformation. However, sometimes, faster algorithms that compute a looser fitting AABB can be used, e.g., 
if the objects arc rigid bodies, then a looser fitting AABB method can be constructed by first determining 
an enclosing sphere, and then finding the AABB enclosing the sphere. 

In order to find an enclosing sphere, we will introduce a preprocessing step where we determine the 
maximum distance, d max , from the center of mass to the surface of an object, A 


dmax = max (||p|| 2 , p<EA). 


( 12 . 2 ) 


This is illustrated in Figure 12.9. For a polyhedron object, d max can be computed by iterating over all 
the vertices of the object. For each vertex, the difference between its position and the center of mass is 
computed, and then it is tested if the length of the difference vector exceeds the previously known distance 
to the surface. The observant reader might object that this iteration is more expensive than the iteration 
used to compute a tight AABB, but recall that the computation is done only once as a preprocessing step, 
whereas the tight AABB computation is performed in each and every query. The pseudocode for this algo¬ 
rithm is shown in Figure 12.10. For an object represented by splines or NURBS, a similar strategy could 
be used, by exploiting the convex hull property of their control points. Implicit surface representations are 
more difficult, but their surface could be sampled [Witkin et al., 1994]. 
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algorithm bruteAABB(M: Mesh) 
min = (infty, infty, infty) 
max = min 

for each vertex v in M do 
if v.x < min.x then 
min.x = v.x 
end if 

if v.x > max.x then 
max.x = v.x 
end if 

next v 

return min and max 
end algorithm 


Figure 12.8: Brute-force computation of an AABB. 



Figure 12.9: Maximum radius to surface of an object. 
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algorithm compute-dmaa: (O : Object) 
V = vertices(0) 

dmax — 0 

for each v £ Fdo 

if dmax < ||v — r cm || 2 then 
dmax — ||tt r cm || 2 
next v 

end algorithm 


Figure 12.10: Maximum radius computation for a rigid polyhedron object. 


Knowing d max . the minimum point is computed a: 

T min — **cm 

and the maximum point is given as 

f'max — f'cm T 

Observe that since d max is precomputed once, finding the AABB is a constant time operation with ex¬ 
tremely low constants, since it only requires a total of six simple floating-point operations. An example 
AABB is shown in Figure 12.11; notice that although the AABB is not the best fit, it is very fast to 
compute. 

If we look at two boxes i and j along a coordinate axis of the WCS, then we can describe the boxes 
on the axis by two intervals 

[b t and [b 3 ... e,], (12.5) 

where 

bi < ej and bj < ej. (12.6) 

These AABB intervals are illustrated in Figure 12.12. Observe that the intervals can be found by projecting 
the minimum point and the maximum point, i.e., the b and e-values onto the axis in question. The 6s and 
es are also sometimes called the endpoints. Clearly if box i and box j overlap along a given axis, then we 
must have 

bj < bi < ej or 6* < bj < e t . (12.7) 

If they are separated, then we must have 


5 


(12.3) 


(12.4) 


e* < bj or ej < 6p (12.8) 

Exactly one of these four conditions is always true, and the other three arc always false, as shown in 
Figure 12.13. Furthermore, if, and only if, their intervals along all coordinates axes overlap, only then 
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Figure 12.11: An example of an A ABB computed by the fast computation method. 


will the two AABBs overlap. As a consequence, in 3 dimensions the problem has been reduced to three 
one-dimensional problems: we simply count the number of overlaps along each of the WCS axes. If the 
number of overlapping axes is equal to three, then we conclude that the two AABBs overlap. Figure 12.14 
shows the pseudocode for this overlap test. In contrast, for objects approximated with spheres, a distance 
computation in 3 D space would be needed. Thus, in the case of spheres, one cannot exploit the property 
of dimension reduction of the AABBs. Comparing the actual costs of sphere testing and AABB testing, 
using squared distances spheres required 4 multiplications, 6 additions, and 1 comparison, where AABBs 
arc handled with only 12 comparisons and 3 logical tests. Thus, spheres arc marginally more expensive 
when considering just a single pairwise test. 

The Locality Principle Another key observation for broad-phase collision detection algorithms is that 
only close neighbors need to be considered when determining overlaps between the approximating geome¬ 
tries. This is termed the Locality Principle. This principle essentially states that objects lying sufficiently 
far - away from each other do not need to be considered as overlapping and can easily be ignored. 

The idea is perhaps best illustrated by the example as shown in Figure 12.15. Here 28 objects arc 
placed in a world, posing (28 • 27)/2 = 378 possible pair test. Many of these arc trivially superfluous, for 
instance, the leftmost object is so far - away from the rightmost object that it would be unnecessary to do 
a pairwise test between the two objects. If we draw arrows between the center of masses of the objects, 
which arc close neighbors, then the potential set of pairwise tests drop to only 42 in the present case. 
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algorithm AABB-overlap (i, j : AABB) 


if ( b jx 

< bi x 

< e ix 

or b ix 

< bj x 

< ei x ) 

i and 

< K 

< bi y 

< e Jy 

or b iy 

< bjy 

< ei y ) 

and 

( b jz 

< bi z 

< ej z 

or b iz 

< bjz 


then 


return true 
return false 


end algorithm 


Figure 12.14: Pseudocode for overlap test between two AABBs. 



Figure 12.15: The Locality Principle. Total number of object pairs is far greater than the number of edges 
between nearby objects. These indicate possible object pairs where it make sense to test for collision. 


yielding a speedup factor of 378/42 = 9. 

In order to exploit the locality principle to its fullest, one needs to come up with a way to deter¬ 
mine when things arc sufficiently far away. Computing a generalized Voronoi diagram [O’Rourke, 1998, 
Berg et al., 1997] from scratch in order to determine the close neighbors would be much too expensive; 
therefore, other methods arc needed. In Section 12.3 and Section 12.4 we will show two examples where 
coordinate sorting and gridding is used to exploit the locality principle. 

The Coherence Principle The third principle is the Coherence Principle, and by coherence we mean a 
measure of how much things change. Low coherence would indicate a configuration, where objects arc 
nearly teleporting around in the world, and high coherence would be the exact opposite, where objects 
move smoothly around. 

The rigid bodies in a rigid body simulator have high coherence, if the time-step is not too big. In 
Figure 12.16, an example is shown where a number of rigid bodies is falling under gravity with a time- 
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step of 0.3 seconds. Observe how object positions and orientations only change a little bit from frame to 
frame. This is an example of coherence. 

Coherence comes in many flavors, four different kinds of coherence arc usually considered in the 
context of physics-based animation. They arc: 

• Spatial Coherence 

• Geometric Coherence 

• Frame Coherence 

• Temporal Coherence 

The first two address the shape and position of objects in the world, where high coherence would indicate 
that objects arc placed nicely. In the following, we will think of this as a measure of how nicely objects arc 
aligned, and how much overlap they pose. The last two kinds of coherence arc indications of how objects 
evolve, that is, how they move and deform as the simulation proceeds. 

High coherence indicates that we would have to compute almost the exact same result in the next 
iteration of our simulation method, which suggests that we can reuse previously computed results from 
the last iteration to obtain the correct results for the current iteration. This is what we mean by the phrase 
dynamic algorithm, not to be confused with dynamic programming. 

The Kinematics Principle When we can benefit from knowing something about the objects trajectory 
in the near future, we say that we exploit the Kinematics Principle, which is the fourth principle. This 
comes in handy when proximity information about near future colliding objects is needed. Figure 12.17 
shows an example of where knowledge of the trajectory could be used to predict a collision between the 
red sphere and the green box and between the red and blue spheres. The knowledge could be used to 
minimize the number of times we actually have to stop the simulation and do collision detection. In the 
case of the example, collision detection only needs to be performed twice: once at each time of impact. 
This is, for instance, useful in order to obtain accurate information for computing Time of Impact ( TOI ) 
estimates. Another example of usage is when four-dimensional collision detection is used, such as space- 
time bounds (S'-bounds). 

12.2 Exhaustive Search 

An exhaustive search is the brute-force approach for doing broad-phase collision detection. Brute-force is 
testing the AABBs of every possible object pair against each other. For this reason, it is also commonly 
called the all-pair test. The time complexity is therefore 

(;)==*=f^»0(n»). (12.9) 

A pseudocode version of the algorithm is presented in Figure 12.18 and as can be seen from the pseu¬ 
docode, exhaustive search only exploits the principle of approximation. At first this algorithm might seem 
to be a poor choice due to its quadratic running time. However, it does pose some nice features: 
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(c) 


(d) 


Figure 12.16: High coherence in a rigid body simulation. Even though objects have moved a lot as seen 
globally, they arc more or less still placed similarly as seen relative with respect to each other. 






^jsK 


Figure 12.17: The Kinematics Principle. Considering future trajectories of the spheres, it is possible to 
predict possible future collisions. 

algorithm exhaustive-search (B : bodies) 
for each b £ B do 
compute AABB of b 
next b 

for each (6, b')b^b' £ B do 

if AABB(b) overlap AABB(b') then 
report overlap of (6, b') 
next (b, b') 
end algorithm 


Figure 12.18: Pseudocode for the exhaustive search algorithm. 


• It is simple and fast to implement 

• It has extremely low time constants (when using AABBs) 

• It is applicable to any kind of bounding volume geometry 

This means that it is practical for small-sized configurations and its simplicity further ensures that it is 
unlikely to be the point of failure. Other more advanced algorithms require a bit of hard work to implement 
and debug. For configurations up to sizes of order 10 — 100 objects it is our experience that Exhaustive 
Search outperforms both Sweep and Prune and Hierarchical Hash tables, as discussed in the following. 





b 1 b 2 e 1 b 3 e 2 b 4 e 3 e 4 b 6 b 5 e 6 e 5 b 7 e 7 axe 


Figure 12.19: AABBs on a single coordinate axis. 

12.3 Sweep and Prune 

It is possible to obtain a far better time complexity than the Exhaustive Search algorithm. In this section 
we will outline an algorithm based on a sweep line approach: it has a linear expected running time and is 
called Sweep and Prune or coordinate sorting [Baraff et al., 2003a, Lin et al., 1994]. 

We will begin with introducing the algorithm in the ID case, where the AABBs simply are bounded 
intervals on the coordinate axis. The situation is depicted in Figure 12.19. First, all the interval endpoints 
arc sorted in increasing order along the coordinate axis, assuming for now that no two endpoints have 
the same coordinate value. The idea is to let a sweep line go from minus infinity to infinity along the 
coordinate axis, and update a status set, S, with every endpoint encountered. It is the purpose of this 
status set to keep indices of those objects whose interval is currently crossing the sweep line as illustrated 
in Figure 12.20. The update rule of the status set when the sweep line hits an interval endpoint is: 

• If the sweep-line hits a starting endpoint, i.e., 6,, then object i is added to the status set S. 

• If the sweep-line hits an ending endpoint, i.e., e- t , then object i is removed from the status set S. 

It is immediately observed that the status set indicates which AABBs overlap along the coordinate axis, 
and we can report all overlapping AABBs on the coordinate axis by extending the two update rules with 
an extra auxiliary rule: 

• When object i is about to be added to S, we report overlaps between i and all other objects currently 
in S. 

The extension to higher dimensions is straightforward. We simply perform a sweep along each coor¬ 
dinate axis; afterward we count the number of times the same overlap has been seen. If we count the 
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Sweep Line 



Figure 12.20: The sweep line and the status set. 


same number of overlaps as the number of dimensions, we immediately know that the two corresponding 
AABBs overlap. This implies that all overlaps found on one coordinate axis should be matched against 
all overlaps found on the other axis, which is a quadratic algorithm in the number of overlaps. However, 
by introducing a counter table and modifying the update rules of the sweep-line slightly, we can get rid of 
the unpleasant quadratic time matching. The modification is as follows (for three dimensions): 

• Use a table of counters, where each counter keeps track of the number of coordinate axes two 
AABBs i and j overlap on. 

• When body i is about to be added to S we increment the counters between i and every other body 
currently in S. 

• If a counter between two bodies reaches the value 3, then we report an overlap between the two 
bodies. 

We now have Sweep and Prune in its most basic and pure form. If the endpoints arc sorted along the 
coordinate axis, it runs in linear time in the number of AABBs, which is very attractive. However, there 
is nothing that guarantees that the endpoints will be sorted upon invocation of the Sweep and Prune 
algorithm, and the algorithm therefore requires a presorting along ah coordinate axes indicating that the 
computational complexity is O(nlogn) [Cormen et ah, 1994], where n is the number of AABBs. This 
is still better than the quadratic 0(n 2 ) running time of Exhaustive Search, but we can do even better by 
taking a close look at the sorting of the endpoints. 

Sweep and Prune requires that a list is sorted in increasing order. There are several sorting algorithms 
[Cormen et ah, 1994] available for doing this: quicksort, bubblesort, insertion sort, etc. We will use 
insertion sort ; later it will become clear why. For now, we will summarize how insertion sort works. For 
more details, we refer to [Cormen et al., 1994]: 

• Insertion sort scans the unsorted list from left to right 

• When a number is encountered, which is smaller than its leftmost neighbor, then insertion sort: 




















(a) First mark (b) Swap (c) Next mark (d) Second swap 


Figure 12.21: Example of insertion sort. 

1. Marks the position to the right of the number in the list. 

2. The number is then repeatedly swapped with its leftmost neighbor until the number is greater 
than or equal to the leftmost neighbor. 

3. The original scan is continued from the marked position. 

An example of how insertion works is shown in Figure 12.21. The worst-case time complexity of insertion 
sort is 0(n 2 ). This occurs if numbers in the initial unsorted list are sorted in decreasing order from left to 
right, in which case every number encountered in the scan is swapped to the front of the list, that is, 

0 + l + 2 + 3 + -- - + (n-l) = n(n ~ ^ « 0(n 2 ). (12.10) 

'--' 2 

n- terms 

If the list is nearly sorted in increasing order then insertion sort behaves much better. Only a few numbers 
need to be swapped and they need not be swapped a very long way. Therefore, in this case, insertion sort 
runs in expected 0(n) time. This indicates that depending on how badly sorted the endpoints arc, we can 
expect Sweep and Prune to have a 1 inear time complexity and a quadratic time complexity as lower and 
upper limits. 

Fortunately in the context of physics-based animation, we can expect a high coherence, which means 
that the intervals of the AABBs along the coordinate axis only change slightly from invocation to invo¬ 
cation of the Sweep and Prune algorithm. If we save the sorted intervals from the previous invocation, 
and use these for the input to the current invocation of Sweep and Prune algorithm, then high coherence 
implies that insertion sort can sort them in linear time, Oin). We have now found a broad-phase collision 
detection algorithm which runs in expected Oin) time. 

Nevertheless, there arc still situations where Sweep and Prune has a quadratic running time as shown 
in Figure 12.22. Here a 2D rigid body simulation is shown, where five rigid balls arc falling under gravity 
along the y-axis and colliding with a fixed object. The balls arc only slightly displaced, such that at the 
frame after the impact, their order along the y-axis is reversed. In this case, insertion sort is passed a list 
of interval endpoints along the y-axis, which is sorted in reverse order. Fuckily, in our experience this is 
more a theoretical issue than a practical issue. 

It is possible to optimize the Sweep and Prune algorithm even more. Inspired by how the interval 
endpoint list was reused from the previous invocation of the algorithm, we will now try to reuse the list of 
reported overlaps from the previous invocation as well. For this purpose we will extend our counter table 
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Figure 12.22: Bad case for the Sweep and Prune algorithm resulting in quadratic running time. 

to also include a pointer for a counter into the list of reported overlaps, if an overlap is reported for that 
counter. Now let us examine what happens during the swapping operation of insertion sort. 

• Imagine insertion sort is swapping b r to the left of e ; , that is previously e } < bi but now e-j > b L . 
This indicates that i and j arc now overlapping, so their counter is incremented and if its value is 
three, an overlap is reported. 

• The opposite might also occur if t } is swapped to the left of bi , meaning that i and j no longer 
overlap. This time the counter between i and j is decremented and if a reported overlap exists, then 
it is removed in constant time by using the overlap pointer between i and j. 

• If insertion sort tries to swap b t and bj , then we do nothing. Similarly, we do nothing if it tries to 
swap e t and e ? . 

With these modifications, we no longer need to have the sweep line and the status set represented explicitly. 
The update actions of the sweep line is performed implicitly, when insertion sort performs a swap between 
two endpoints. The pseudocode for sorting one coordinate axis in the Sweep and Prune algorithm can be 
seen in Figure 12.23. 

So far we have assumed that no two endpoints have the same coordinate value along a given coordinate 
axis. If this happens, the algorithm will not report an overlap. In practice this can have fatal consequences. 
A missing overlap means that no contacts are detected further down the collision detection pipeline, thus 
the simulator might fail to react on a touching contact. An example of such a simulation is shown in 
Figure 12.24. We will now deal with this degenerate case: if we apply the rule that b’ s are to come before 
e’s, then the inner while loop in Figure 12.23 becomes 

While scan < left or ( scan = left and left=eand scan =b) Do 

The implication of this is that touching AABBs will also be reported. This is useful, since the surface of 
the enclosing object could be touching the surface of the enclosing AABB. 

























algorithm coordinate-sort-axis (L : List (endpoints)) 
scan = head(L) 


while L unsorted do 

mark = right(scan),left = left(scan) 
while scan < left do 

if scan = bi and left = ej then 
increment counter (i,j) 
if counter (i,j) =3 then 
report overlap (i,j) 
else if scan = ej and left = bi then 
decrement counter (i,j) 
if counter (i,j) =2 then 
remove overlap (i,j) 
end if 

scan = left,left = left(scan) 
end while 
scan = mark 
end while 
end algorithm 



Figure 12.24: Illustration that missing detection of overlaps in the broad-phase collision detection algo¬ 
rithm can result in bad simulation behavior. A resting stack of rigid spheres in a gravitational field with 
their AABBs superimposed. Simulation frames arc shown from left to right, reported overlaps and detec¬ 
tion of contacts only occur at places marked with an arrow. Observe that penetrations occur due to the 
lack of collision resolving. 
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Figure 12.25: A single-level grid. 


12.4 Hierarchical Hash Tables 


Another approach to broad-phase collision detection is to use a spatial data structure, such as a rectilinear 
grid. Mirtich [Mirtich, 1996] introduced a multilevel gridding algorithm, known as hierarchical hash 
tables, which will be explained in this section. 

Consider the simulated world divided into a rectilineal - grid using the same size grid spacing along all 
coordinate axis. Such a spatial subdivision is called a tiling. The bounded region, where the simulation 
is expected to take place, can be represented by a minimum and maximum point in a manner similar to 
an AABB. A resolution of the cells can then be given as the edge size of the grid cells as illustrated in 
Figure 12.25 for a 2D example. If we have a tiling and a given cell resolution, p, every cell can be uniquely 
identified by a triplet ( a , /3, 7 ) in 3D. The triplet consist of set of order indices indicating the order of a 
cell along the respective coordinate axis. The idea of triplets is illustrated for the 2D case in Figure 12.26. 
These triplets uniquely identify a given cell in the tiling and can be used as hash keys. Given an arbitrary 
point, p, we can find the triplet of the enclosing cell in the tiling as 


t(p,p) 



{a, P, 7 ) • 


( 12 . 11 ) 


Here t is the minimum point of the minimum cell. 

Using (12.11), all cells in the tiling that overlap with a given AABB are easily found by identifying the 
triplets spanned by the mapping of the minimum, p min , and maximum point, p max , of the given AABB, 
that is, all triplets satisfying the relation 


T {Pmin:P ) < 7 ) < T(p max ,p). 


( 12 . 12 ) 


The idea is illustrated in the 2D case on Figure 12.27. If every cell can store the AABBs that it overlaps 
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algorithm store (b : AABB) 

for each c where r(p min , p)<c< r(p max ,p) do 
for each b' £ c do 

counter (6, b') = counter (b, b ') +1 
if counter (b, b') =1 then 
report overlap (6, b') 
next b' 
add b to c 
next c 

end algorithm 


Figure 12.28: Pseudocode for storing AABB in single-level grid. 


with, then we have a data structure we can use for broad-phase collision detection. All we need is a table 
of counters, where each counter remembers the number of cells where two AABBs, i and j, both were 
mapped to. 

If the counter has value zero, then no cell exists where both i and j were mapped to. If a counter has 
value one, then there exists exactly one cell, where both i and j have been mapped to. If the counter has 
value two, then there exists two different cells, and so on. 

In the Exhaustive Search and Sweep and Prune algorithms we required the actual AABBs to overlap; 
this is not necessary for gridding algorithms. Sometimes a gridding algorithm reports AABBs as being 
overlapping, if there exists at least one cell in the tiling where they both arc mapped to. In this case, 
overlapping means that the approximate geometry (i.e., the AABBs) of the objects lie sufficiently close. 
Of course one could extend such a gridding algorithm with an extra test, which examines for overlap with 
approximate geometry. Therefore, in the remainder of this section we say overlapping whenever we mean 
that there exists a cell where two AABBs arc both mapped to. 

This indicates a way to report AABBs lying sufficiently close to each other: when an AABB is mapped 
into the tiling, the counters between the newly added AABB and those already stored in a cell is incre¬ 
mented by one. If during this operation we encounter a counter that is incremented to the value one, then 
it is the first time this overlap between the two AABBs is seen. The pseudocode for storing an AABB in 
the grid is shown in Figure 12.28. 

The next time the single-level gridding algorithm is invoked we must start by dealing the tiling, which 
may be done by removing AABB in a similar way to how they were added. The difference is that this 
time the counters arc decremented, and when they reach a value of zero any reported overlap is removed. 
The pseudocode for the removal operation is shown in Figure 12.29. 

With the remove and store operations, we arc now capable of using the tiling for doing broad-phase 
collision detection, simply by iterating over all AABBs. For each AABB, we first remove it from the 
tiling using its position from the last invocation. Then we update the position of the AABB for the current 
invocation before finally storing the AABB in the tiling. The pseudocode for the algorithm is shown in 
Figure 12.30. Notice that we need to know both the old and new position of the AABBs. 

It is possible to apply some optimization to the gridding algorithm just described. Clearly a lot of 
remapping is performed by the repeated removal and addition of an AABB. If we have high coherence, it 
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algorithm remove (b : AABB) 

for each c where r(p min , p) <c< T{p max ,p) do 
remove b from c 
for each b' £ c do 

counter (fe, b ) = counter (b,b ) - 1 
if counter (b, b') =0 then 

remove reported overlap b,b') 
next b' 
next c 

end algorithm 


Figure 12.29: Pseudocode for removing AABB from a single-level grid. 


algorithm single-level (B : Bodies) 
for each b £ B do 
remove (AABB(b)) 
compute new position of b 
store (AABB{b)) 
next b 

end algorithm 


Figure 12.30: Single-level gridding algorithm for broad-phase collision detection. 


is very likely that an AABB is remapped to the same cells in the tiling, and unnecessary computations arc 
thus taking place. We can remedy this by exploiting cached information from the previous invocation of 
the gridding algorithm. If we cache the location of the minimum and maximum points of the AABB from 
the previous invocation, then we can perform a test in the current invocation to see whether the current 
positions of the minimum and maximum points of the AABB maps to the same triplets as in the previous 
invocation. We only need to remap an AABB if the minimum and maximum points map to different triples 
than they did in the previous invocation. 

The simple gridding algorithm we have outlined so far; is not without problems. If p is too big, then 
the algorithm is no better than Exhaustive Search, since every AABB will be mapped to the same triplet. If 
p is too small, then we will spend a lot of time on mapping AABBs into the tiling. To avoid this we would 
like the cells to be of the same size as the AABBs of the objects. However, the size of the objects can vary 
greatly, so it is impossible to decide a good value of p that would work for all objects. In conclusion, the 
single-level gridding algorithm is a good choice when all objects arc of similar size. 

For objects varying greatly in size we can use multiple levels in the gridding algorithm. Assume that 
we have multiple tilings, for instance, m different tilings, each with its own resolution, and assume that 
the fth tiling has the resolution denoted by pi. Furthermore, assume that all the tilings can be sorted 
according to their resolutions, such that 

0 < p m < Pm-1 < Pm-1 < • • • < P2 < Pi- (12.13) 
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We define the size of an AABB, B to mean 

size (73) — max (Vmax x 'f'min x i^'max y min y i max z • (12.14) 

We would like to have some means to describe how well B fits into a given tiling, say the j’th tiling. In 
order to do so we will introduce two constants a and (3 as lower and upper bound of the fitness. We require 
that there exist a resolution pj, such that 

sizei B ) 

a <--—- < (3 and 0 < a < 1 < /?, (12.15) 

Pj 

is fulfilled. By picking specific values for a and / 3, we can control our sense of fitness. If a = 0.5 and 
(3 = 1.0 in (12.15), then it means that B must have a length that is from 0.5 to 1.0 times the width of the 
cells at resolution p } . 

Naturally, not all our resolutions will fulfill this constraint for a given AABB, and there might even be 
more than one resolution fulfilling the constraint. In order to define a unique best-fitting resolution for a 
given AABB, we pick the largest resolution fulfilling the constraint. That is, for B we have 

res(B) = max ( pk I a < ——-—- < (3 ) , (12.16) 

V Pk J 

which is the maximum value of p fulfilling the constraint. The largest resolution is picked such that if 
B was mapped into the corresponding tiling, then we will map B to fewer cells than if we had picked a 
smaller resolution fulfilling the constraint. The semantics can be quite confusing, however, we adopt the 
convention that 

• Higher resolution (larger value of j in pj) means smaller cell size (i.e. smaller p value) 

• Lower resolution (smaller value of j in pj) means larger cell size (i.e. larger p value) 

Figure 12.31 shows a ID example of the definitions introduced so far. In the example, tilings of different 
resolutions arc drawn on top of each other. The thick lines indicate the p-values of the tilings. When we 
work with multiple levels, we will store (and remove) an AABB in all levels with a resolution that is lower 
(having larger p-value) than or equal to the resolution of the AABB itself. Figure 12.32 illustrates how 
an AABB, A, is mapped into the ID multilevel tiling shown in Figure 12.31. The filled box indicates the 
resolution of A, and as can be seen, A has the resolution of the tiling with a cell size si mi lar to the size of 
A. 

The reporting of overlaps is handled a little differently from the case of the single-level algorithm 
introduced previously: the counter between two AABBs, i and j, is only updated at resolution k if 

Pk = max res (6,;), res(6j). (12.17) 

This condition essentially masks out the tiling having the best resolution for detecting an overlap between 
i and j. The condition should be obvious, since we cannot use a minimum resolution value because only 
one of the AABBs is mapped at this level. The maximum resolution value is the first level, where both 
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Figure 12.31: ID multilevel gridding, showing correspondence between resolution definitions. 



Figure 12.32: ID multilevel gridding, showing the mapping of an AABB 
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AABBs arc mapped into, and therefore the level with the best resolution for detecting overlap between the 
two AABBs. 

Figure 12.33(a) shows an example where two AABBs, A and B, arc mapped into the multilevel grid. 
In Figure 12.33(b) is another AABB, C, about to be mapped into the tilings. We leave it to the reader as an 
exercise to determine the proper counter values. The pseudocode for the multilevel gridding algorithm is 
shown in Figure 12.34, the multilevel store operation is shown in Figure 12.35, and the multilevel remove 
operation is shown in Figure 12.36. The time complexity for updating the multilevel gridding algorithm 
is lineai; 0(n), since the number of levels are limited and the a-(3 constraint equation gives an upper limit 
on the number of cells that overlap an AABB, such that it takes constant time to store or remove an AABB 
from the multilevel grid. Since we have n AABBs, it must take lineai' time to update the multilevel grid. 
However, the total time complexity depends on the maximum number of AABBs mapped into a single 
cell. If we denote this number by m, then the time complexity of the entire algorithm is 0(n + m 2 ). In 
practice, the number of tilings and their resolutions, together with the a and (3 constants are chosen such 
that m 2 n, and such that the quadratic term can be ignored. 

There are some additional benefits of using multilevel gridding. The time complexity is independent 
on the configuration. We saw in the case of Sweep and Prune, that for some configurations, one could 
expect the worst-case quadratic time complexity. The multilevel gridding algorithm does not suffer from 
this weakness. Another advantage is that coherence does not change the time complexity, it only reduces 
the workload, while Sweep and Prune was highly dependent on high coherence in order to perform well. 

There are drawbacks, however, of this algorithm as well: in order for multilevel gridding to be efficient 
in practice, one needs a good hash table implementation, which is not as common as list and tree data 
structures. The worst drawback is that the algorithm may require a huge memory usage, in particular 
if parameters are chosen poorly. In comparison. Exhaustive Search does not use any memory at all, 
and the Sweep and Prune algorithm only needs the counter table, which in worst-case is 0(n 2 ), but in 
practice, often is Q(n), and the interval endpoint lists, which is 0{n). The multilevel gridding algorithm 
also needs a counter table as Sweep and Prune, however the multiple tilings can consume quite a lot of 
memory: letting d be the dimension, m the number of levels, and p the maximum resolution value, then 
the memory consumption of this spatial subdivision scale in the worst-case is 0( n ^ L ). In our experience, 
multilevel gridding is competitive with Sweep and Prune for medium-sized configurations, as the memory 
usage grows large for the multilevel gridding algorithm page swapping degrades performance and Sweep 
and Prune becomes superior. 


12.5 Using the Kinematics Principle 

In the three broad-phase collision detection algorithms we have presented, we have used the approxima¬ 
tion principle and the coherence and locality principles, but none of the algorithms have exploited the 
kinematics principle. In this section we will present some ideas for how this last principle can be put to 
use. 

Sweeping volumes [Mirtich, 1996] is one way of incorporating knowledge about the future. These 
come in many different variations, but they all share the same properties: 

• They are volumes in 3D space. 
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(b) Box C is mapped 


Figure 12.33: Updating of counter value in a multilevel grid algorithm. 
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algorithm multilevel( B : Bodies) 
for each b € B do 
p — res{AABB(b )) 

for each tiling t where res(t) < p do 
remove (AABB(b),t) 
next t 

compute new position of b 
for each tiling t where res(t) < p do 
store (AABB(b)) 
next t 
next b 

end algorithm 


Figure 12.34: The multilevel gridding algorithm. 


algorithm store (b : AABB,T : Tiling) 

for each c € C where r(p min ,p T ) < c < r(p max ,p T ) do 
for each b 1 £ c do 

if pr = max (res(b), res(b')) then 

counter (6, b') = counter (6, F) +1 
if counter (b, b') =1 then 
report overlap# b') 
end if 
next b' 
add Hoc 
next c 

end algorithm 


Figure 12.35: The multilevel gridding store operation. 


algorithm remove ( b : AABB,T : Tiling) 

for each c <= C where r(p min , p T ) < c < r(p max ,p T ) do 
remove b from c 
for each B' £c do 

if pr = max (res(b), res{b')) then 

counter (# b' ) = counter (6, b ') - 1 
if counter (b, b') =0 then 

remove reported over lap6, b') 
end if 
next b' 
next c 

end algorithm 


Figure 12.36: The multilevel gridding remove operation. 
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• They represent the space an object travels through in a given time duration t — t cur . 

• They arc often not tight, meaning they cover more space than the objects travel through; however, 
they arc always an upper bound. 


Here we will derive a sweeping volume, which is an AABB, but our idea is not specific for AABBs and 
could be applied to other volume types as well. From kinematics, we know something about the trajectory 
of an object: 

dVcm d V n'n't 

“ cm = = ~d^- (2 - 18) 

In order to find a formula for the trajectory of an object, we will try to integrate the acceleration to find 
r cm as a function of time. If we assume that the acceleration is constant in the near future, then 


d 2 1 


dt 2 


= a r 


d 2 i 


Integrating once more, w.r.t. time yields 

dr r 


dt 


dt 2 
dv cm 
dt 


-dt = 


dt = 


dcm dt. 


— Vcm T 


(^cm H~ Q-cm/) dt , 


T cm (t) — V cm + V cm t + 0 CL cm t . 


(12.19a) 

(12.19b) 

(12.19c) 

(12.20a) 

(12.20b) 


We thus have a function describing how the center of mass moves in the near future assuming that it moves 
with constant acceleration. In the limit of very small time steps (t —> t cur ), this is a fair assumption, and 
the trajectory computed will be a close approximation to the true trajectory of the object. Convergence 
theory indicates that the approximation will be better and better the smaller time-steps we use. In the 
theoretical limit the approximation will be exact. Therefore, the basic idea is to find the extreme points 
of the trajectory along the coordinate axes of WCS and then place an AABB enclosing the object at 
these extreme points. This is done by first placing smaller AABBs around each of the extreme positions 
enclosing the object at these positions. Afterward, one large AABB is found enclosing all the smaller ones. 
This final large AABB is computed as the sweeping volume of the object. This seems straightforward; we 
only need to figure out: 


• How do we find the extreme points? 

• How do we find an enclosing AABB of a set of smaller AABBs? 


We observe that the trajectory is described by a second-order polynomial, so its curve is a parabola. If we 
assume that gravity is the only force acting on the object, then a cm is pointing downward (i.e., [0,-1,0] T ). 
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Figure 12.37: An AABB sweeping volume example. Sweeping volume is computed as the AABB cover¬ 
ing the AABBs of object in initial, top, and final position. 


The extreme points will then be the stalling and ending point of the parabola and the top point. The large 
enclosing AABB is found by taking the minimum and maximum coordinates of all the smaller AABBs. 
The construction algorithm is shown in Figure 12.37. 

Space-time bounds [Cameron, 1990] also describe how an object moves in the near future, but they 
arc very different from sweeping volumes. They represent the time dimension, hence for a 2D object the 
space-time bound will be a 3D volume, and for a 3D object the space-time bound will be a 4D volume. 
We usually work in 3D and thinking of 4D volumes might be a little difficult, but the benefits arc 

• Even though sweeping volumes intersect, the space-time bounds might not. 

• With space-time bounds we can find the exact time of the intersection. 

In conclusion, the extra dimension give us more detail of the movement. 
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Introduction to Narrow-Phase Collision De¬ 
tection 


There is a large number of narrow-phase collision detection algorithms, far too many to lit into a single 
chapter in this book. Therefore, this short chapter serves as a brief introduction to concepts and goals in 
narrow-phase collision detection, which we need to know before talking about contact determination in 
Chapter 14. Later, in Chapter 15, Chapter 16, and Chapter 17 we will treat different groups of narrow- 
phase collision detection algorithms in depth. 

Broad-phase collision detection lists pairs of potential colliding objects, and the narrow-phase collision 
detection examines each of these pairs in detail. For each pair of objects, the narrow-phase collision 
detection determines whether the objects arc separated, touching, or penetrating. Most existing algorithms 
for narrow-phase collision detection arc capable of returning much more detailed information, such as: 

_ • separation distance _ 

• penetration depth 

• separation axis 

• closest points 

• closest features 

The choice of the right narrow-phase collision detection algorithm depends on the later stages in the 
animation pipeline ; some examples arc: 

• closest points can be used in contact determination 

• separation distance/axis may be needed for the computation of Time of Impact ( TOI ) 

• penetration depth is required for calculating contact forces and backtracking 

There is a wealth of literature on narrow-phase collision detection algorithms, and many differ¬ 
ent approaches have been investigated. Spatial subdivision algorithms, like Binary Space-Partitioning 
(BSP) tree [Melax, 2001], octree [Tzafestas et al., 1996, Ganovelli et al., 2000, Erleben et al., 2003b], k- 
d trees, and grids [Ganovelli et al., 2000, Erleben et al., 2003b]; feature-based algorithms, like polygonal 
intersection [Moore et al., 1988], Lin-Canny [Ponamgi et al., 1997], V-Clip [Mirtich, 1998b], and SWIFT 
[Ehmann et al., 2001]; recursive search methods, like [Sundaraj et al., 2000] and simplex-based, like GJK 
[Gilbert et al., 1988, Bergen, 2001], Lately, volume-based methods, such as generalized Voronoi dia¬ 
grams [Hoff, III et al., 1999] and signed distance maps [Guendelman et al., 2003, Bridson et al., 2003, 
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Hirota, 2002] have become popular. Finally, there arc algorithms based on bounding volume hierarchies. 
It is beyond the scope of this chapter to explain all of these algorithms; we refer the reader to Chapter 15, 
Chapter 16, and Chapter 17 for more details. 

Narrow-phase collision detection algorithms arc naturally divided into four main groups: 

• Spatial Data Structures such as spatial subdivisions and bounding volume hierarchies 

• feature-based 

• simplex-based 

• volume-based 

In the spatial subdivisions, we approach the problem of collision detection by building a data structure 
containing the geometry. The data structure is built by subdividing the object into pieces, for instance, by 
clipping the object recursively by planes. Bounding volume hierarchies arc very similar to this, but while 
spatial subdivisions do not respect the underlying features of the geometry, bounding volume hierarchies 
do. As an example, a subdivision algorithm might clip a face in a polygonal mesh into two pieces, each 
stored in different locations of the data structure, while a bounding volume hierarchy never does this. It 
will treat each face as an unsplitable atomic entity. More detail is found in Chapter 15. 

A feature-based algorithm works directly on the features of an object. For polygonal meshes, the 
features arc the vertices, edges, and faces. A simple feature-based approach would clip every face of one 
mesh against all the faces of the other mesh and vice versa. If the clipped result ends up being empty, then 
no penetration of the objects has occurred, but if the result is not empty, then there will be a penetrating 
volume of the objects. Feature-based algorithms arc discussed in detail in Chapter 16. 

A simplex-based algorithm is based on a simplex, which is the convex hull of an affinely inde¬ 
pendent set of points. We will not go into further details here, but instead, we refer the reader to 
[Gilbert et al., 1988, Cameron, 1997, Bergen, 1999, Bergen, 2001, Bergen, 2003b]. Simplex-based algo¬ 
rithms often work by incrementally improving upon a simplex, which approximates the configuration 
space of the objects. The subject has a mathematical nature, which tends to discourage some people. 

Volume-based algorithms have a volumetric view of the objects. An example could be the sampling 
of the signed distance function of an object onto a 3D rectilineal - grid. In such a case, the closest points 
and separation distances from a given location from the object can be found by looking up values and 
gradients in the signed distance grid. These kind of algorithms are rather new compared to the spatial data 
structures, feature-based, and simplex-based algorithms. Volume-based algorithms will be discussed in 
Chapter 17. 

The major challenges for narrow-phase collision detection are deforming geometry, self-intersections, 
and penetration depths. For deforming geometry, the spatial data structures need to be updated in ev¬ 
ery invocation of the narrow-phase collision detection algorithm, which is time consuming. Simplex- 
and feature-based algorithms usually assume objects are convex, which might not hold after a deforma¬ 
tion. Volume-based methods pose some of the same problems for deforming geometry as the spatial data 
structures, e.g., a deformation would require an update of the signed distance map. Self-intersections 
are unpleasant by nature, which is seen by an example: for a polygonal mesh, every face is touching its 
neighboring faces, indicating a collision, but not a self-intersection. Search and traversal methods pose 
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difficulties in effectively pruning away unnecessary tests. Existing algorithms for calculating penetration 
depths arc extremely slow. 

In summary, narrow-phase collision detection algorithms solve the problem of determining whether 
two objects intersect (intersection query ) or how far apart they arc {proximity query ). Narrow-phase 
collision detection algorithms often need to be superseded by a contact determination algorithm, which is 
the subject of the next chapter. 
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Contact Determination 


The task of computing regions of contact from the output of the narrow-phase collision detection algorithm 
is called contact determination, and it is a pure geometrical problem. Before we can present algorithms 
for contact determination, it is important to define what we mean by a region of contact and how it should 
be represented. 

The reader should be familiar with concepts and goals of narrow-phase collision detection as described 
in Chapter 13. Detailed knowledge of specific narrow-phase collision detection algorithms is not required 
for reading this chapter. 

14.1 Contact Regions, Normals, and Planes 

A touching point between the surfaces of two objects is called a contact point. In practice however, 
this is not always a suitable representation. Quite often a simulation method needs information about 
the neighborhood of a touching point, i.e., the shape. At the point of contact of two smooth surfaces, the 
surfaces will have coinciding tangent planes, and these tangent planes serve as a good linear approximation 
to the two surfaces at the point of contact. We call such a tangent plane the plane of contact, and its normal 
the contact normal. 

In a computer world, we work with polygonal objects and we cannot always take the myopic view of 
the real world. Polygonal objects arc inherently discontinuous over the edges and vertices. Therefore, 
a slightly different approach is often used to represent the neighborhood information for contact points 
between polygonal objects. The contact point is represented by two features, one from each polyhedra, 
and the point of contact is usually computed as the closest point between the two features. Such a rep¬ 
resentation is called a Principal Contact (PC). For polyhedron objects we have three different kinds of 
features: a vertex, V, an edge, E, and a face, F. Due to symmetry, we only have six different types of 
principal contacts: ( V ., V), (V. E ), ( V. F), (F. E ), {E, F), and (F, F ). Figure 14.1 illustrates the six dif¬ 
ferent principal contacts. Observe that the closest point is not uniquely defined in all cases, and that some 
types of principal contacts represent entire continuous regions of infinitely many contact points. These 
problems will be deferred for later treatment in this section. 

Since objects arc represented by features; it seems natural to use them for representing contact points. 
It is quite easy to deduce information about the shape of the surface in a neighborhood of a single touching 
point. For instance, in the case of an (F, V) contact, the contact plane is obviously picked unambiguously 
as the plane of the face. Finally, it is easier to recognize the same contact point evolving over time (i.e., a 
temporal touching point) based on a pair of features rather than a 3D world location, because the features 
will temporarily be the same pair by coherence, whereas the 3D world location could change substantially. 

The finite precision of floating point representation introduces some problems in connection to touch¬ 
ing contacts: intuitively we define a touching contact as the point between two objects where the distance 
is exactly zero. However, numerical imprecision will make such a distance slightly nonzero, typically in 


437 


“book” — 2005/9/30 — 15:44 — page 438 — #450 


438 


CHAPTER 14. CONTACT DETERMINATION 




(a) (1/, V) 


(b) (V, E) 





(d) (E,E) 


(e) ( E,F) 



Figure 14.1: The six different principal contact types. 


the range of 10 _ '-10 _15 , depending on the precision of computation. To help reduce such problems, we 
introduce a collision envelope 

around each object, as illustrated in Figure 14.2. The collision envelope works as a slightly soft 
enlargement of the objects, defining a held around the object where it can consider touching contact with 
other objects. That is, whenever the distance between the closest points of a feature pair is less than 
the collision envelope, then the corresponding contact point is said to be a touching contact. The closest 
points between a feature pair using a collision envelope might no longer be exactly the same for a touching 
contact. To remedy, the point of contact, p, is usually computed as the mean of the closest points of the 
two features, p A and p B 


Pa + Pb 
2 


(14.1) 


Thus, given two features, one from object A and one from object B, we need to be able to compute the 
closest points between the features. In case of a (V,V) contact type, the closest points arc simply given 
by the vertex positions. In the case of an (E. V) contact type, we can project the position v of the vertex 
V onto the edge E running from position o to position d and having direction u 
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Figure 14.2: The collision envelope. 


where u = d — o. If the projection lies between the two positions o and d, then the projection indicates 
the closest point on the edge. If the projection lies outside the endpoints of the edge, then the closest 
endpoint to the projection is taken to be the closest point. Figure 14.3 illustrates the (E, V) contact type. 
In case of a (1/, F) contact type, we project the position, v, of the vertex V onto the face plane of the face 
F, represented by a face normal, n, and an arbitrary point, /, on the face, 

Pf = v - ((v - f) ■ n)n. (14.3) 

This is illustrated in Figure 14.4. If the projected vertex lies inside the face boundary, then its position 
gives the closest point on the face. If it lies outside, we can repeat the (V, E) case for each boundary edge 
of the face in order to find the closest edge and the closest point on this edge. 

The case of an {E\. E 2 ) contact type is a little more difficult. Assume that the closest points, p E and 
p ly2 , between the two edges arc given. Then 


and 

PE 1 =Ol + ((; p E2 ~ Oi) • Mi) Mi, 

(14.4) 


Pe 2 =o 2 + ((p El ~ 02 ) ■ m 2 ) m 2 . 

(14.5) 

Inserting 

equation (14.5) into equation (14.4) yields 



P El = °1 + {{°2 + {{P El - °-2) ■ U 2 ) M 2 - Oi) • Ml) Ml. 

(14.6) 

Isolating 

P El results in 

/(o 2 - 01 ) • (mi - ku 2 )\ 

(14.7) 


Pa = °i + ( )“■' 
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V 


Figure 14.3: Closest point between vertex and edge. 

V 



Figure 14.4: Projection of a vertex onto a face. 


where k = u \ ■ u 2 . Finally, we can find p Ei by applying the ( E. V) case. The two remaining cases of 
the (E , F) contact type and the (F. F) contact type can be handled by decomposing them into a set of the 
cases we already know how to handle. If the edge E runs between vertices O and D and the boundary 
edges of F is given by {(E, E\), (E, E 2 ),...}, then we can decompose the case (E , F ) as follows: 

(E, F ) - {(O, F), (D, F), (E, E,), (E, E 2 ),...} (14.8) 

If the boundary of the face F\ is defined by the edges { Eq , E \, E \,...} and the boundary of face F 2 by 
{Eq, E\ , E%, ■ ■ then we can decompose (F\. F 2 ) into 

(Fi,F 2 ) - {(. E 1 0 ,F 2 ), (El, F 2 ),(F\,Eq), (F\,E\),...}. (14.9) 

Actually, from this we can deduce that all contact points can be represented by three cases: (V, E), ( E , E), 
and ( V, F ), which is the smallest set of principal contact types we can use to represent contact points. The 
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Figure 14.5: Contact regions of polyhedra objects. 



Figure 14.6: Examples of contact geometry between polyhedral objects. 


observant reader will have noticed that the (V, V) case is not included. This is because the contact point 
could equally be represented by an (V, E ) case. We will later elaborate on this redundancy in contact 
point representation. 

We define the contact region as the intersection of the two objects. Figure 14.5 shows two polyhedra 
examples. Notice we can have multiple regions between two objects and regions arc not necessarily planar. 
For polyhedra objects, the contact region is a polygonal area, which consist of: 

• single points 

• line segments 

• or closed polygons (perhaps with holes) 

Figure 14.6 shows three examples of contact geometry between polyhedral objects. For convex polyhedra, 
the contact regions is particularly simple. It is either a single point, a single line segment, or a closed planar 
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Figure 14.7: Contact region represented by polygonal vertices of the intersection. 


convex polygon without holes. This indicates that we can represent contact regions by the vertices of the 
polygonal regions. It is easy to determine principal contacts for representing these polygonal vertices. The 
set of principal contacts used for representing the polygonal vertices is called a contact formation ( CF ). 
Figure 14.7 illustrates the idea. In the figure, the CF consists of the principal contacts PCPC 2 , PC 3 , 
and PC 4 . 

The usage of contact formations is much more general and can be used even in the case of penetration, 
as Figure 14.8 illustrates. In the figure, the CF could consist of the principal contacts, 

PC\ ( E A ,F B ), 

PC 2 ( E A ,F B ), 

PC's ( E A ,F B ), 

PC4 (E A ,F B ). 

It is apparent that the definition of a contact formation gives a valuable tool for representing contact 
regions, but unfortunately the representation of a CF is not unique. Figure 14.9 shows an example where 
a single (V, F ) or four (E, F) contact points representing the same contact region. 

When objects touch in real life, the contact normals are parallel to surface normals at the point of 
contact, but this is not true for polyhedra objects as Figure 14.10 shows. Two methods for computing 
contact normals can be used. We may pick the normal of the feature with the highest dimension, and 
in case both features have the same dimension, we make a random pick. Alternatively, we can compute 




“book” — 2005/9/30 — 15:44 — page 443 — #455 



14.1 PHYSICS-BASED ANIMATION 


443 



“book” — 2005/9/30 — 15:44 — page 444 — #456 


444 


CHAPTER 14. CONTACT DETERMINATION 



Figure 14.10: Contact normals for polyhedra arc not easily defined. At the point of contact, the normal of 
the triangle is nonparallel with the normal of the square. 



Figure 14.11: A discontinuous change of contact normal. A box slides from left to right, when the left 
lower corners of the box reach the tip of the Y-shape, normals arc picked randomly from the (V. V) 
contacts. 


the normal as a unit vector along the direction of the closest points between a pair of features. The first 
method suffers from discontinuity of contact normals, meaning that as a simulation proceeds, the direction 
of the contact normal can change abruptly even when the motion is smooth, as illustrated in Figure 14.11. 
In the figure, a box slides smoothly across a supporting object. In the first frame all normals are pointing 
downward (we have ( F. V) contact types), but at the instant the box reaches the state shown in the last 
frame, the contact normals on the left side of the supporting object would be picked randomly from the 
(Y, V) contact type. None of these normals points in the same direction as the face normals previously 
used. One remedy is to exploit caching of contact normals from previous frames to avoid abrupt changes 
of the normals. However, this solution comes with a bookkeeping cost. In practice, the problem is often 
ignored. The argument being that these types of contact often have a short lifespan so an end user will 
never notice the problem in a real-time simulation. The second method for computing the contact normal 
obviously breaks down in cases where the closest distance of the features is zero. In such cases, one often 
resorts to using the first method for computing contact normals. 

Knowing the contact point, p, and the contact normal, n, of a touching contact, we can define a contact 
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Figure 14.12: A thick contact plane. 


plane as the set of all points, x, that fulfills the equation of a plane 

{x | (x — p) ■ n = 0} . (14.10) 

Observe that for convex polyhedra, all touching contacts lie in the same plane, when no penetration occurs. 

In this section we have given definitions of principal contacts and contact formations. We have dis¬ 
cussed difficulties in computing contact points and contact normals; we have also treated the uniqueness 
problem of contact formations. In Section 14.2 and Section 14.3 we will present two different algorithms 
for computing the contact formation between two objects. The algorithms make use of the definitions and 
concept introduced so far. 


14.2 A Geometrical Algorithm 

Assume that we are given the two closest features and two convex polyhedra, for instance, by V-Clip (see 
Section 16.3) or GJK 

[Gilbert et ah, 1988, Cameron, 1997, Bergen, 1999, Bergen, 2001, Bergen, 2003b]. If they arc within 
the collision envelope, then we have a touching contact between the convex polyhedra. We will now 
develop a simple algorithm to determine the CF. In the first step of the algorithm 

• the two features will be the first principal contact in our CF 

• the touching principal contact will be used to determine a contact plane Q 

Since we arc using a collision envelope, we will make Q a thick plane as shown in Figure 14.12. Because 
the objects arc convex, we know that all other features, which might cause a touching contact, must lie 
within Q. So we start at the initial feature of object A and examine features in a breadth first traversal over 
the mesh of object A. When a feature is found lying inside the thick plane Q, it is added to the impact 
zone , Za, of object A. The impact zone is thus the set of touching features. If a feature is encountered 
lying outside Q, then the neighbors arc not considered for further traversal. That is, we start at the first 
feature and examine neighboring features recursively. If all points of a feature, Xa, are inside Q, then we 
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Figure 14.13: Projected impact zones for both objects. 


add feature to impact zone, Z a- Afterward, the same procedure is repeated for object B and we will have 
two sets of features, Za and Zj>. 

If we project the impact zones Za and Zb onto (the thin) Q, then we have reduced our problem of 
finding principal contacts to a two-dimensional problem. An example of how the projected impact zones 
could look like is shown in Figure 14.13. One way to think of these impact zones is as the print pattern 
of a wet soccer ball on the ground. The projected pattern corresponds to features; principal contacts can 
therefore be determined by examining how the two patterns intersect each other. 

To examine the intersection of the patterns, we first test projected edges from Za for intersection 
against projected edges of Zb, features that we considered to be open point sets. If an intersection is 
found, we add the corresponding pair of edge features as a contact point to the CF. Figure 14.14 illustrates 
the edge-edge case. Afterward, we test projected vertices from Za against projected faces of Zb- If 
a vertex is found to lie inside the boundary of a face, then a corresponding (V, F) contact is added to 
the contact formation. Hereafter, the roles of A and B arc interchanged as illustrated in Figure 14.16. 
Similarly, we test projected vertices against projected edges. Figure 14.17 illustrates this step for our 
example. In the final step, we test projected vertices against projected vertices. In the example, there is 
none of these kind of intersections. Looking at all the intersecting projected features we can build the 
contact formation for the touching convex objects. 

The algorithm we have presented is fairly simple, and it is quite easy to implement. The concept of 
using the thickening plane is a great way to combat numerical imprecision, and it is our experience that it 
works well in practice. If objects become highly tessellated, then the algorithm will return many principal 
contacts in the CF lying inside the polygonal intersection. It is possible to add a further postprocessing 
step eliminating these principal contacts by computing the convex hull. The observant reader might argue 
that the algorithm is ineffective due to the pairwise testing of projected features, which would lead to an 
quadratic time complexity. However, the number of projected features is usually low, and the quadratic 
time complexity will therefore be insignificant compared to the time consumed by the narrow-phase col¬ 
lision detection algorithm. In case the number of projected features grows exceedingly large, a sweep line 
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Figure 14.14: Edge intersections of the projected impact zones. 


or spatial data structure approach could be used to reduce the quadratic time complexity. 

14.3 A Contact Tracking Algorithm 

Now we will look at an entirely different kind of contact determination algorithm, called contact tracking 
[Mirtich, 1998a]. The algorithm is applicable to arbitrary shaped polyhedra, but only usable if a retroactive 
detection is used, that is, a backtracking time-control algorithm. We will assume that the narrow-phase 
collision detection is capable of returning pairs of intersecting features as is the case for bounding volume 
hierarchies (BVHs) discussed in Chapter 15. 

A backtracking algorithm means that the time control allows for penetrations to occur; when this 
happens, the simulator is rewound to a state where penetrations disappear - . If a contact point is missing, 
then the simulation method will not be able to apply contact forces and/or contact impulses where they 
are needed. As a consequence, penetrations will occur at these points, when the simulation is run forward. 
This means that when we detect penetrating features, they actually represent a missing contact point. 
Therefore, we can add a penetrating pair of features to the set of potential principal contacts in the CF. 

After we have added potential principal contacts, we let the time control backtrack the simulation, and 
next time the contact determination algorithm is invoked, we validate the set of potential principal contacts 
to form the actual CF used to compute the contact forces and/or impulses. 

When we validate the potential principal contacts, we require that their closest points are within the 
collision envelope and that the closest points must be within the corresponding external Voronoi regions 
of the features. Voronoi regions are discussed in Chapter 16. All principal contacts not fulfilling these 
requirements are removed from the CF. A 2D example of the algorithm is shown in Figure 14.18. 

One problem with the contact-tracking algorithm is that we can get redundant principal contacts. For 
instance, if an edge penetrates a face, then both the edge and the endpoint lying below the face could be 
used to represent the principal contact. We could compute the actual contact points of all principal contacts 
and remove those with identical coordinates, but it would be better to use the concept of the smallest set 
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Figure 14.15: Intersection test of projected vertices of A against projected faces of B. 


of principal contact types. 

The contact tracking algorithm appeal's to be much more ad hoc than the geometrical approach, and 
it is impossible to assert anything about its time complexity because it is highly intervened with the 
backtracking of the simulator. However, the method is easily implemented and works well in practice. 
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Figure 14.16: Intersection test of projected vertices of B against projected faces of A. 



Figure 14.17: Intersection test of projected vertices against projected edges. 
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Figure 14.18: 2D example of contact tracking algorithm. 
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Bounding Volume Hierarchies 


There arc two types of spatial data structures: spatial subdivision and bounding volume hierarchies ( BVH ). 
Spatial subdivision recursively partitions the embedded space of an object, whereas bounding volume hi¬ 
erarchies arc based on a recursive partitioning of the primitives of an object as stated in [Gottschalk, 2000], 

Using spatial partitioning has the disadvantage that splitting of polygons is usually unavoidable. This 
may increase the depth of the tree data structure, which is undesirable for performance reasons. Besides, 
the cells produced by the partitioning tend not to cover the objects tightly, thus making the cells less 
suitable for determining contact status when models arc close together. This is a disadvantage in physics- 
based animation where contact status needs to be determined very efficiently. Due to these problems 
with spatial partitioning and space considerations in this book, we will not treat spatial partitioning here. 
Instead we will focus on an in-depth treatment of bounding volume hierarchies. 

Bounding volume hierarchies arc popular and widely used in computer graphics for e.g. ray tracing 
[Goldsmith et ah, 1987], and in animation for e.g. cloth animation [Bridson et ah, 2003], since they arc 
applicable to a more general shape than most feature-based and simplex-based algorithms. They tend to 
generate smaller hierarchies than spatial subdivision algorithms and they offer a graceful degradation of 
objects, which is highly useful when accuracy is to be traded for performance. 

A bounding volume hierarchy (BVH) is a discrete representation of all levels of details. At the first 
level, we have a coarse representation with less-to-no detail, usually a single bounding volume 

(BV) of the object. At the next level, more detail of the object is represented, and we say we have 
a finer scale. This level is often constructed by splitting the original object into disjunct partitions and 
placing bounding volumes around each partitioning. The following levels reveal more details of the object 
in a recursive fashion, by splitting the geometry of each disjunct partition at the previous level into several 
disjunct subpartitions and placing a bounding volume around each of these subpartitions. The finest level 
of the representation—the finest scale—is the object primitives, such as lines, triangles, or tetrahedra. 

The successive scale refinement naturally introduces a parent-child relationship between bounding 
volumes at succeeding levels, with the topology of a tree; therefore, the data structure is called a bounding 
volume hierarchy. 

15.1 Tandem Traversal 

Tandem traversal is an algorithm for testing overlap between two BVHs. We will introduce tandem 
traversal with an example. Consider a 2D stick object, as shown in Figure 15.1. The sticks are referred 
to as the geometrical primitives or simply the primitives. In this example, we will use bounding circles, 
implying that at the first level we place a single bounding circle enclosing the entire stick object as shown 
in Figure 15.2. In order to obtain a more detailed version, the stick object is further divided into three 
disjunct partitions: the first partition is given by the two topmost sticks, the next partition is given by the 
two connected bottom-right sticks, and the last partition is given by the two leftover sticks. Enclosing 
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Figure 15.1: A 2D stick object; sticks arc label ,Sj,..., Sq. 



Figure 15.2: A single circle, a\, enclosing the 2D stick object shown in Figure 15.1. 

circles are placed around each of these partitions, as shown in Figure 15.3. Each of these three new 
circles is linked to the initial circle representing the entire object. Finally, each of the three disjunct 
partitions arc split into two disjunct partitions, and for our example this is the finest level, since each 
partition now contains a single stick. Enclosing circles arc placed around each of these partitions, as 
shown in Figure 15.4. The new circles arc linked to the parent circles from the previous level shown 
in Figure 15.3. The bounding volumes at the finest level arc called the leaves. Quite often the leaves arc 
annotated by the geometry they enclose, in which case the BVH is annotated. We can now draw the tree of 
the BVH we just constructed, as can be seen from Figure 15.5, using the labels introduced in Figure 15.1, 
Figure 15.2, Figure 15.3, and Figure 15.4. This is an example of an annotated BVH. Drawing all the levels 
superimposed on top of each other, as shown in Figure 15.6, we will gain more insight into how BVs at 
different levels relate to each other. Notice that a BV does not necessarily enclose its children BVs; only 
the geometry enclosed by its children needs to be strictly enclosed. Hence: 

Property 15.1 (BV Property) 

A Bounding Volume (BV) is defined such that: a BV at level i encloses the geometry of all its children at 
level i — 1. 
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Figure 15.3: Three circles: b\, 62 , and 63 enclosing three disjunct partitions of the 2D stick object as 
shown in Figure 15.1. Notice that circles might overlap; it is the underlying geometry that is partitioned 
into disjoint sets. 



Figure 15.4: Circles: C\, C 2 , C 3 , 64 , C 5 , and Cq at the finest level. Each enclosing a single stick of the 
object, as shown in Figure 15.1. 



Figure 15.5: Tree topology of BVH built in. 
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Figure 15.6: All bounding circles of the BVH shown in Figure 15.5 drawn on top of each other. 


It is important to notice that Property 15.1 only says something about how tight, but not how loose, a BV 
can be fitted to its children. For instance, it might be useful to build a BVH such that parent BVs enclose 
all the geometry of the children BVs. 

Now let us tty to see how we can use the BVH data structure. Imagine two identical and overlapping 
stick objects, as shown in Figure 15.7. In the following, they will be referred to as the left and right 
object. In order to determine whether the stick objects overlap, we will recursively examine their BVHs. 
Stalling at the two root bounding volumes, we see these overlaps, as shown in Figure 15.8. If the root 
bounding volumes had not been overlapping, we could immediately conclude that it would be impossible 
for the two objects to be colliding, using Property 15.1. Unfortunately, we cannot infer the opposite, and 



Figure 15.7: Two identical and overlapping stick objects, each is identical to the stick object in Figure 15.1. 
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Figure 15.8: Root BVs overlap, indicating possible collision at a liner level. 



Figure 15.9: Left root BV from Figure 15.8 has been replaced by its children. 


we must look for an answer at a lower scale. We do this by replacing one of the root volumes by its 
children, say the left one, and we will have the situation depicted in Figure 15.9. This is called descending 
because we arc picking a path to move down the hierarchy toward the leaves. The rule for picking which 
volume to descend along is called the traversal rule. As can be seen from the figure, we now have three 
circles from the left hierarchy, which we test for overlap against the root circle from the right hierarchy. 
Again, an overlap means that a collision might occur at the finest scale, and a separation means that no 
collision is possible. In our case, all three circles of the left hierarchy overlap with the root circle of 
the right hierarchy. To lower the chance of finding overlapping BVs, we replace the largest volume with 
its children, as shown in Figure 15.10. We now test each of the three right circles against the three left 
circles yielding a total of nine pairwise tests. Luckily only two of the nine pairwise tests arc overlapping. 



Figure 15.10: Right root BV from Figure 15.9 has been replaced by its children. 
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Figure 15.11: Overlapping circles from the right object in Figure 15.10 have been replaced by their chil¬ 
dren. 



Figure 15.12: Overlapping circles with largest volume in Figure 15.11 have been replaced by their chil¬ 
dren. 


indicating a collision at a finer scale. The remaining seven pairwise tests arc nonoverlapping and can thus 
be discarded from further consideration. This is called pruning. The two pairwise overlapping cases arc 
between circles of the same size, so we randomly choose to descend down the right hierarchy illustrated in 
Figure 15.11. Here the top circle from the left hierarchy is tested against the two topmost circles from the 
right hierarchy, and the bottom circle from the left hierarchy is tested against the two bottom most circles 
from the right hierarchy. The bottom circle from the left hierarchy and the bottom rightmost circle from 
the right hierarchy are nonoverlapping and arc therefore pruned. We pick the largest volume to descend 
along giving us the situation in Figure 15.12. At this finest level, only two BVs arc overlapping, the two 
topmost circles and the two bottommost circles, all other pairs of circles arc pruned. Having reached 
the leaves of the BVHs, the only way we can obtain more details about the collision between the two 
stick objects is by considering the annotated primitives. This can be done in two different ways. In the 
first method, we replace the BVs from one hierarchy by the enclosed primitives, and test these primitives 
against the BVs from the other hierarchy, as shown in Figure 15.13. If the pairwise primitive-BV test 
indicates overlap the remaining overlapping BVs arc replaced by their enclosed primitives, and we arc 
now left with pairwise primitive tests, as shown in Figure 15.14. From these pairwise primitive tests we 
can now return a precise description of the overlapping of the two stick objects. 
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Figure 15.13: Replacing overlapping leaf circles from right object in Figure 15.12 with annotated primi¬ 
tives. 



Figure 15.14: Replacing overlapping leaf circles from left object in Figure 15.13 with annotated primi¬ 
tives. 


The other method simply skips the step of testing primitives against BVs. This has the benefit of 
simplifying the implementation efforts, since one can skip the code for overlapping tests between BVs 
and primitives and only consider the two cases of pairwise BVs and pairwise primitives. Although the 
pairwise BV and primitive testing is often less expensive than the pairwise primitive testing and could 
lead to further going directly for the pairwise primitive testing, it means a reduction in the number of 
iterations arc needed. Therefore, there is a tradeoff between the number of iterations and the complexity 
in the overlap testing. 

The recursive collision query we performed in the example above is called a recursive tandem traversal 
because two hierarchies arc traversed simultaneously in a recursive manner. Figure 15.15 illustrates a 
pseudocode version of the collision query algorithm. In practice, it is often a bad choice to implement the 
recursive tandem traversal by using recursive calls [Gottschalk, 2000] because the number of primitives 
in an object can be quite large and the hierarchies for such large objects will be even larger. Hence, the 
number of recursive calls could potentially be very large, leading to a memory stack overflow on the 
memory heap. 

Fortunately the recursive traversal is easily rewritten into an iterative traversal by using a.first-in-first- 
out queue (FIFO): initially, the root bounding volumes arc pushed onto the queue, then a loop is set up, 
popping the two first pairs of volumes from the queue, and the overlapping test is performed as usual. 
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algorithm recursive (A : BV ., B : BV ) 
if not colliding (A, B) then 
return 

if leaf (A )and leaf(B) then 

contact-determination (A,B) 
continue 
end if 

if descend (A) then 

for all children C of A do 
recursive (C,B) 
next C 

if descend (B) then 

for all children C of B do 
recursive (A,C) 
next C 

end algorithm 


Figure 15.15: Recursive tandem traversal collision query. 


However, when the actual descend is about to be performed, instead of invoking a method recursively, the 
descended volumes arc pushed onto the queue and thus processed later in the loop. Figure 15.16 shows 
an iterative version of the pseudocode from Figure 15.15. Algorithmic variations can be applied to the 
traversal scheme. For instance, alternative traversal orders could be used: preorder, in-order, or post-order. 
The first corresponds to a breadth-first traversal, the last two arc similar to depth-first traversals. Observe 
that if a stack is used instead of a queue in the iterative traversal, then we will have a depth-first traversal 
instead of a breadth-first traversal. 

The idea of using a queue can be taken one step further by introducing a priority on the pairwise 
BV tests. This is useful, if one wants to perform a time-critical tandem traversal. In most cases there 
will be more than two objects in a world; a time-critical query would be to consider all pairs of colliding 
objects. The broad-phase collision detection, of course, returns all such pairs that need to be considered 
(see Chapter 12); therefore, all pairs of root BVs are initially pushed onto the queue and given a priority. 
Afterward, a loop is set up si mi lar to the pseudocode shown in Figure 15.16. The only difference is that 
an extra condition is added to the loop, testing whether more time is available for performing the collision 
query. Figure 15.17 outlines the pseudocode for a time-critical collision query. It is important to notice 
that all object pairs are handled simultaneously: since each BV pair pushed onto Q gets a priority, we will 
descend simultaneously through all BVHs according to whatever priority rule is applied. For more details, 
see [O’Sullivan et ah, 1999, Dingliana et ah, 2000]. 

15.2 Coordinate Space Updates 

We’ll examine some of the details in performing the collision queries in Figure 15.15, Figure 15.16, and 
Figure 15.17. We need to consider at least two kinds of coordinate spaces when we perform a query. 
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algorithm iterative (A : BV, B : BV ) 
Queue Q 
push (Q,A,B) 
while not Q empty do 
pop (Q,A,B) 

if not colliding (A, B) then 
continue 

if leaf (A )and leaf(B) then 

contact-determination (A,B) 
continue 
end if 

if descend(A) then 

for all children C of A do 
push (Q,C, B) 
next C 

if descend (B) then 

for all children C of B do 
push ( Q , A, C) 
next C 
end while 
end algorithm 


Figure 15.16: Iterative Collision Query. 


algorithm time-critical(.. . ) 

Queue Q 

O = results from broad-phase 
for all pairs of overlap (A, B) £ O do 
push (Q,A,B) 
next (A, B) 

while not Q empty and time left do 
pop (Q,A,B) 

end while 
end algorithm 


Figure 15.17: Time-Critical Collision Query. BV pairs are kept in a queue; during traversal, pairs are 
popped off the queue and tested until time is up or the queue is empty. 
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wcs 

Figure 15.18: Geometries A and B arc defined in their model spaces, which are placed in a common world 
coordinate system. 


Usually the object geometry is created off-line in another application, and the coordinate space in which 
the object was initially created is called the model space, or model frame. When the object is used in 
an animation or simulation, it is moving around in a coordinate space called the world coordinate system 
(WCS). WCS is a fixed global coordinate frame, and any motion can be described absolutely in it. The 
model frame is fixed w.r.t. to the object, meaning that if the object moves and translates in the world 
coordinate system, then so does the model frame. 

BVHs arc built from the geometry of an object, therefore it is most natural to represent them in the 
model frame. When we perform an overlap test between two BVs from different objects, we need to bring 
these into a common coordinate frame in order to perform the test. This action is called BV updating or 
simply updating. Different updates need to be considered for rigid and deformable objects. Deformable 
objects arc usually represented in WCS, and during a simulation, the world coordinate position and shape 
of the individual geometric primitives of the deformable object arc altered, causing the initial BVH to 
become misaligned. Deformable objects thus require a different kind of update involving possible re¬ 
building, rebalancing, and refitting of BVHs. This will be treated in Section 15.7. For rigid objects, three 
possibilities arc available for BV updating. Consider two objects A and B, as shown in Figure 15.18. 
We can transform the BV from the model frame of A into the model frame of B or vice versa, and these 
two kinds of transformations are referred to as a model space update. The third and last possibility is to 
transform both BVs from their model frames into the common world coordinate system, which is called a 
world space update. 

A third local coordinate system often used for rigid objects is the body frame. This is an object-fixed 
coordinate frame with the origin placed at the center of mass and the axes aligned with the principal axes 
of inertia. During simulation, it is the body frame of a rigid object which is moved, and before starting a 
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AABB again: an expensive refitting of the BV yields optimal tight tit (RIGHT), or a cheap wrapping 
yields a loose fitting (MIDDLE). 


collision query we must first determine where the model frame is placed in the world coordinate system 
before we can apply either a model space update or a world space update to the BVs. Fortunately, the 
transformation between the body frame and model frame is always the same and needs to be found only 
once at initialization time. 

For BVs aligned to a specific frame such as an axes aligned bounding box (AABB), a rotational motion 
will make the AABB nonaligned, that is, turn it into an oriented bounding box ( OBB ). Thus the BV updat¬ 
ing for AABBs not only involves coordinate transformations but also requires a realignment. Figure 15.19 
illustrates the problem together with two solutions. Refitting the BV is computationally expensive, since 
it requires us to examine all the geometric primitives covered by the BV and find the smallest AABB. The 
benefit is that the new AABB will have a tight fit, and thus provide the best pruning capability. Another 
less expensive approach is to place an AABB around the OBB, thus avoiding having to examine all the 
geometry covered by the BV. This is a fast and cheap solution, but the new BV will be loose, imply¬ 
ing a worsened pruning capability. The loose method is commonly referred to as approximate, dynamic 
updating or simply, wrapping. The difference between the two methods is illustrated in Figure 15.19. 

Some geometries arc nonaligned, such as OBBs, and nonaligned BV types do not require refitting. 
Some BV types arc even more favorable. For instance, spheres arc completely independent of their orien¬ 
tation, implying that only their positions need to be considered for the update. 

There is both a performance and a fitting aspect to the model space update, since only one BV needs 
to be updated, implying half the work compared to a world space update. Only the updated BV might 
suffer from a looser fit, in contrast to a world space update where both BVs might suffer from a looser 
fit. Due to the performance and fitting benefits, model space updating is often preferred over world space 
updating. 

Figure 15.20 contains pseudocode for a model space update, and Figure 15.21 contains pseudocode 
for a world space update. 

In every single BV comparison, at least one BV update is taking place; as we saw in the simple stick 
object example in Section 15.1, a single BV can be part of many BV comparisons. In the example, we 
first tested two root BVs, but in the second iteration only one of these was descended, resulting in several 
BV comparisons between children and a root. Thus the root BV might be wastefully updated at each 
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algorithm colliding (Aa, Bb ) 

Let Tab be transform from B to A 
Ba = Tab{Bb) 
return overlap(AA, Ba) 
end algorithm 


Figure 15.20: Model space update from model frame of B, to model frame of A. Subscript-denoted 
coordinate space representation. 


algorithm colliding (Aa, Bb ) 

Let Twcs,a be transform from A to WCS 
Let Twcs,b be transform from B to WCS 
Awes = Twcs,a(Aa ) 

Bwcs = Twcs,b{Bb) 
return overlap(Awcs, Bwcs) 
end algorithm 


Figure 15.21: World space update. Subscript-denoted coordinate space representation. 


comparison. This should be avoided by using a caching strategy, for example. 

A simple approach to update caching is to let the node in the hierarchy have two BVs associated with 
it: one native BV, which is the BV originally created in the model space, and a second BV, which is the 
updated BV together with a time stamp of the current collision query. During a comparison, one first 
examines the time stamp of the nodes. If a node does not have an up-to-date time stamp, then the second 
BV is updated by using the current transformation on the native BV. Finally, the time stamp is set to reflect 
that the BV has been updated. After having taken care of time stamp testing and possible BV updating, 
an overlap test can be performed between the two updated BVs. This strategy is simple, but there arc 
some drawbacks. First, the amount of storage needed for a BVH is almost doubled, since two BVs arc 
needed for each node in the hierarchy instead of just one. Second, it places some restrictions on world 
space updates, since two identical objects cannot share the same BVH data structure. Nevertheless, there 
is no problem for the model space update, since there is no problem in testing the native BV against the 
updated BV in a shared BVH. That is, for a world update, the second BV update will overwrite the first. 


15.3 Approximating and Hybrid BVHs 

Figure 15.22 shows a space fighter. The space fighter is a polygonal model consisting of little more than 
80,000 faces. If we were to build a traditional balanced binary BVH, where each leaf covers exactly one 
face, then we will need a binary tree with 80,000 + (80,000 — 1) = 159.999 ~ 160 K nodes. The kind 
of object shown is suitable for a gaming context, but in this context, a BVH of 160 A nodes would be 
considered gigantic and of little practical usage. 

To obtain a workable representation for a time-critical application such as a game, we will start by 
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Figure 15.22: A space lighter polygon mesh. 


approximating the object. As can be seen in the first row of Figure 15.23, we have done a volume sampling 
of the object. We have manually placed a set of primitive BV types: cylinders, spheres, and OBBs. The 
BVs arc placed such that they closely resemble the object. Observe that at some places the original object 
sticks outside the volume sampling; other places the volume sampling floats a little bit over the object. 
This is perfectly legal for an approximation. 

The BVs in the volume sampling will be used as leaf nodes in the resulting BVH. The topology of the 
resulting BVH is drawn to the left of the space tighter as it is being built. 

Next we will tty to find the parents of the leaf nodes. This is done by merging close leaf volumes, and 
then approximating them by a suitable BV type. In the next row, we see how the cylinders and OBBs of 
the three wings have been merged into three OBBs. In the next step, the cockpit paid and the three wings 
arc merged into a single root sphere. 

Counting the number of nodes in the final BVH, we see that the BVH contains a total of 16 nodes, 
yielding an improvement of a factor of 160, 000/16 ~ 10000 in size. The decrease in size is not only 
reflected in storage requirements, but also the performance of a collision detection query is drastically 
improved. 

In this example, we have chosen not to annotate the leaves of the BVH with the primitives they 
cover. This means that a collision detection query will determine penetration based solely on the leaf BV 
geometries. The kind of hierarchy shown in Figure 15.23 is termed an approximating heterogeneous BVH 
or an approximating hybrid BVH. It is called approximating because the leaves only approximate the true 
shape (surface) of the underlying object. The phrase heterogeneous or hybrid reflects the fact that different 
kinds of B V types arc used. The opposite, where only a single B V type is used, is sometime referred to as 
a homogeneous BVH. 

Approximating hybrids arc particularly effective, but it is difficult to automate the construction of 
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Figure 15.23: Example of an approximating heterogeneous BVH. BVH is built bottom up using a volume 
sampling for the leaves. 
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approximating hybrid BVHs. Such hierarchies arc quite common in computer games and often they arc 
hand-modeled as illustrated, which is a tedious, time-consuming, and error-prone approach. In our opin¬ 
ion, the most difficult paid of an automatic construction method is to obtain the initial volume sampling, 
also called an object segmentation. Two different policies can be applied for a volume sampling: fully 
conservative coverage or sample-based coverage. 

Figure 15.24 illustrates how the approximating hybrid BVH is used in a collision query. The figure 
should be read in a top-down manner. The left column shows the result of the recursive BV overlap test, 
and the right column shows how the corresponding traversal rule replaces an overlapping BV with its 
children. Observe that the traversal rules will pick the largest volume to descend. If two overlapping 
volumes arc the same size, a random volume is picked. The recursive collision query ends when two 
leaf BVs arc tested for overlap. In this illustrative example the collision query ends with having found a 
touching contact between two cylinders, one from each space fighter. 

As can be seen from Figures 15.23 and 15.24, an approximating BVH provides a progressive re¬ 
finement of an object. In a collision query, this property can be used to interrupt the collision query if 
available time is exhausted, and use the overlapping BVs at the level where the query was interrupted to 
form a contact point for the collision resolving. Usually sphere trees arc used for this [Hubbard, 1996, 
O’Sullivan et al., 1999, Dingliana et al., 2000, Bradshaw et al., 2004]. 

15.4 Performance and Complexity 

To achieve a good performance of a collision query, one seeks good pruning capabilities of the BVs in 
the BVH. This implies that a BV comparison is capable of rejecting a pair of BVs for further processing 
at the earliest possible level. Obtaining good pruning capabilities has an impact on both the topology of 
hierarchy and the fitness of BVs. There is not a single overall goal for constructing a BVH with good 
pruning capabilities, but rather it’s a tradeoff between different properties of the BVs. As a consequence 
it becomes very application dependent to pick the best way to build a BVH. 

We saw some properties of the BVs in a BVH from the simple stick object example in Section 15.1. At 
each level of the hierarchy, the primitives covered by a single BV at that level arc also covered by the union 
of that BV’s children at the next level, and the leaf nodes in the hierarchy each cover a single primitive. 
Furthermore, child BVs can extend beyond the space of their parent, and sibling BVs can overlap each 
other. 

It is important the BVs arc as tight fitting as possible to be able to reject a BV comparison at an early 
stage, simply because tighter fitting BVs cover a smaller space and thus it is less likely that other BVs will 
cover the same space. 

A similar argument holds for reducing the amount of overlap between siblings in the hierarchy. A 
large overlap between siblings indicates that they cover the same space, thus a traversal might descend 
simultaneously down two paths in the hierarchy. If there were no overlap between siblings, the traversal is 
more likely to pick only one of the paths. In practice, trying to reduce the overlap between siblings implies 
looser fitting BVs and vice versa, thus the two desired properties for obtaining good pruning capabilities 
arc conflicting. 

To reduce overlap while having a tight fit of BVs, one could adopt more complex BV shapes, such as 
those with increasing complexity: spheres arc simple, axes aligned bounding volumes (AABBs), oriented 
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Figure 15.24: Example of collision query using the approximating hybrid BVH from Figure 15.23. 
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Faster overlap tests 



Spheres 



AABBs OBBs Convex Hulls 



Figure 15.25: Tradeoff between BV geometry complexity, number of overlap tests performed in a traver¬ 
sal, and the cost of performing a single overlap test. 


bounding boxes (OBBs), and convex hulls. In general, one could choose any kind of BVs, convex or non- 
convex. We only need to know how to fit it to a subset of primitives, and how to detect overlaps between 
two BV. 

Theoretically, we could pick a very complex BV fulfilling our two requirements for good pruning 
capabilities; however, this will not resolve our problem of obtaining good performance due to tradeoffs 
between BV geometry complexity, cost of BV overlap tests, and number of overlap tests. 

Imagine having picked a complex geometry with perfect pruning capabilities, implying tightest pos¬ 
sible fit and no overlap between sibling BVs. The traversal algorithm would pick the optimal path down 
the hierarchies during a traversal such that no wrong path is examined. In such a case, the number of 
overlap tests is reduced to a bare minimum. Unfortunately, the cost of performing an overlap test be¬ 
tween two complex geometries is expected to increase with complexity, and therefore it might be better 
to have picked a simpler geometry, with a cheaper overlap test, even though we will perform a number of 
unnecessary overlap tests due to the loss in pruning capability. This is illustrated in Figure 15.25. 


Two definitions for measuring the tightness of a BV arc presented in [Zachmann et ah, 2003b]: 

Definition 15.1 (Tightness by Hausdorff Distance ) 

Let B be a BV, G some geometry covered by B. The Hausdorff distance, h(B, G), is defined to be 

h(B , G) = max min d (b, g ), 
beB g£G 


(15.1) 
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where b is a point on the surface of B and g is a point on the geometry. G, is the distance between 

two points. The Hausdorff distance measures the maximal distance of B to the nearest point in G. Let the 
diameter of the geometry G be defined as 

diam(G) = ma xd(g,f), (15.2) 

9>/eG 

then we can define a tightness measure, t, as 


h{B, G) 
diam(G) 


(15.3) 


The Hausdorff distance can be computationally extensive and numerically sensitive to compute. A tight¬ 
ness measure based on volume can therefore be used: 


Definition 15.2 (Tightness by Volume) 

Let C(B ) be the set of children of a BV B, let vol(B) be the volume of a BV B, then tightness, t, can be 
defined as 


vol(B) 

EceC(B) vol ( c ) ’ 


(15.4) 


If L{B) is the set ofleafBVs beneath B, then an alternative tightness definition can be used: 


vol(B) 


(15.5) 


The topology of the tree structure can be described by the number of children a nonleaf node has. If 
all nonleaf nodes have n children, then we have an n-ary tree. A frequently used case is n = 2, which 
is also called a binary tree. The number of children is often referred to as the cardinality or the degree. 
Given the maximum degree, d, and the number of primitives, n, the height, h, of the complete tree is at 
least 

h>\\og d (n)], (15.6) 

where equality only holds when the tree is balanced. The height of a tree is defined as the number of nodes 
encountered on the longest path from any leaf node to the root node. Furthermore, the BVH will have at 
most, 2n — 1 nodes and n of these are leaves. This follows quite easily, since a binary tree will have the 
smallest (sensible) degree, and any higher degree will lower the number of internal nodes. These bounds 
on BVH height and node count only hold when leaf nodes cover exactly one distinct primitive. 

The topology of the BVH can also play an important role for the performance of a collision query. 
Often, balanced binary trees arc wanted since these arc known to have good search properties. However, 
as our previous examples show, a hierarchy does not need to be binary. In fact, quad-trees and octrees 
have been reported to be significantly faster than binary trees [Mezger et al., 2003, Larsson et al., 2001], 
Furthermore, as far as we know, there have been no experiments showing that binary balanced BVHs arc 
always the best choice. For instance, one could argue that for approximating BVHs queries, it would be 
feasible to have the BVH balanced w.r.t. volume [Somchaipeng et al., 2004]. 
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Figure 15.26: Dumbbell, showing two ends with nearly equal volume, but with a large difference in 
number of primitives. 

As an example, imagine a dumbbell figure where one end is finally tessellated, and the other end has 
a coarse tessellation as shown in Figure 15.26. From a volume-approximating viewpoint, both ends arc 
equal, since they have nearly the same amount of volume inside them. From a surface point of view, one 
end is much heavier than the other and requires a finer subdivision. Thus, creating a balanced hierarchy 
w.r.t. volume and number of primitives will be focused quite differently. 

For surface-based BVHs, OBBs arc superior over spheres and AABBs [Gottschalk, 2000]. AABBs 
[Bergen, 1997, Larsson et al., 2001] arc the preferred choice for deformable objects. The reason is that 
during simulation, the BVs in the BVH constantly need to be updated, and the cost of refitting an OBB is 
very expensive compared with the cost of refitting an AABB. Hence, AABBs can outperform OBBs, even 
though OBBs have better pruning capabilities. For volume-based methods, spheres arc often the preferred 
choice [Hubbard, 1996, O’Sullivan et al., 1999, Bradshaw et al., 2004]. 

In general, one tries to keep the overall size of the BVH as small as possible; with a small height these 
two properties arc known to improve the speed of a search through the tree structure. Obviously for a 
surface-based BVH, the number of primitives have a direct impact on these two desired properties, since 
increasing the number of primitives will make the BVH large and increase the height as well. 

Let’s su mm arize the characteristic of a good BVH: 

• small size 

• small height 

• good pruning capabilities; no overlap between BVs at the same level and smallest fitting BVs 

• balanced hierarchy—in terms of data structure and in terms of volume 

Frequently, the time required to perform a collision query is approximated by the linear combination 

T = N V T V + NpTp (15.7) 
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Here, N v and N p denote the number of overlap tests for bounding volumes and primitives respectively, 
and T v and T p denote the corresponding average times to perform these tests. 

Generally speaking, one seeks to make N v and N p as small as possible by picking the bounding 
volume with the best pruning capability for the problem at hand. Also T„ and T p should be as small as 
possible by choosing simpler geometries. Unfortunately, these two minimization goals are conflicting as 
explained earlier. 

The particular - query in question also has an impact on the running time. If a close proximity query is 
performed, for example, a cloth falling over an object, then N p is likely to increase rapidly. The primitive 
testing is in many cases more expensive than the simpler bounding volume overlap testing. 

A third term should be included in the time cost, such that the BV updating is taken into account. This 
was suggested in [Klosowski et al., 1998] yielding 

T = N V T V + N p Tp + N U T U (15.8) 

This update term covers transforming BVs into the same coordinate frame before performing an overlap 
test. This transform might include a dynamic update of the BV geometry itself. For instance, if an 
AABB BVH is built and kept in the model space of a rigid object, then the model frame will have a 
different orientation than the world frame during rotational motion, indicating that the AABBs are no 
longer AABBs, but have become OBBs. 

One resolution to the problem would be to update the OBB into an AABB, such that an AABB overlap 
test can be performed. The drawback is that the updated AABB could have a much looser fit than the OBB. 
This was discussed in detail earlier; we refer the reader to Figure 15.19. Another approach would be to 
perform a more expensive overlap test, capable of dealing with the OBB instead of the AABB. 

For deformable objects, the updating cost is a little different. It is actually more of a refitting cost, 
implying that the average time T u might not be a constant. As an example, imagine a simple update 
strategy as follows: before a query, the two BVHs are updated in a bottom-up fashion, leafs are refitted 
to cover their annotated primitive, internal nodes are refitted to tightly cover the primitives given by the 
union of the primitives covered by their children, and the root is refitted to cover all primitives. Obviously, 
the leaf refitting is very cheap, and the closer an internal node is to the root, the more expensive the update 
will be. Deformable objects will be treated more thoroughly in Section 15.7. 

Lower and upper bounds on a balanced binary BVH consisting of ideal BVs have been proven 
[Gottschalk, 2000]. An ideal BV is defined as one with perfect pruning capability, that is, it is so tight 
fitting that it only overlaps with other BVs if the covered geometry overlaps. Suppose we have two mod¬ 
els each consisting of n primitives. Assume that the BVHs are perfectly balanced using ideal BVs. If the 
models are touching and there are k > 0 contacts, then the number of BV tests required to identify all k 
contacts is at least 

vi{n, k) = 21og 2 (n 2 //c) + 2k — 1, (15.9) 

and at most 

u u {n, k) = 2k \og 2 (n 2 /k) + 2k — 1. (15.10) 

Obviously the lower bound on the number of ideal BV tests is also a lower bound of real BV tests. The 
upper bound represents the worst-case performance for ideal BVs, and therefore it represents a lower 
bound on the worst-case performance for ordinary BVs. 
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Figure 15.27: The performance impact of primitive count, proximity type, and shape complexity on a 
collision query. 


Of course, in the real world there is no such thing as an ideal BV, and we arc left with real BVs. A 
worst-case time complexity for real BVs is difficult to derive, if possible at all, due to the many factors 
involved—the complexity of the object shape, the number of primitives in the object, the BV geometry, 
the topology of the BVH, etc. 

Of course, it is possible to construct situations where the worst imaginable time complexity will be 
0(n 2 ); for instance, if one were to create and place two objects, such that all their primitives were in¬ 
tersecting each other. However, in physics-based animation we often work with objects that resemble 
real-world objects, implying that the geometry for solids is a connected nonintersecting twofold, and at 
least in their rest shape they arc nonintersecting with themselves. A 0(n 2 ) time complexity is therefore 
not likely to occur. Figure 15.27 illustrates our discussions on the time complexity of a collision query. 


15.5 Hierarchy Construction Methods 

There arc many approaches for building BVHs. They can be classified into three main classes: top-down, 
bottom-up, and incremental methods. Their names reflect how the BVH is traversed during creation of the 
nodes in the hierarchy. In Section 15.1 and in Section 15.3 we described two different ways for obtaining 
the example BVHs: the stick object applied a top-down method, whereas the star fighter model used a 
bottom-up approach. We have not shown any examples of an incremental method, but here, the general 
idea is to extend a BVH incrementally. Incremental methods arc, to our knowledge, not widely used in 
collision detection. Each of the three main groups can be further subdivided into subgroups. 

Top-down methods can be grouped as being based on a subdivision scheme or a statistical approach. 
Both of these subgroups arc treated in detail in Section 15.5.1. 

Bottom-up methods come in two flavors, depending on whether a volumebased BVH or a surface 
BVH is wanted. Volume-based BVHs usually start out by performing some sort of volume sampling of 
an object. Medial surfaces arc very popular for volume sampling. Alternatively, diffusion-based methods, 
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such as using scale space and deep structure may be used [Somchaipeng et al., 2004]. Alternatives arc 
convex decompositions or simply voxelizations; even subdivisions by octrees have been used to obtain a 
volume sampling of an object. Surface-based BVHs tend to be based on a merging strategy. Here, BVs arc 
initially formed for the leaves of the BVH and various strategies can then be applied to pick BVs, which 
undergo a merging operation. Theoretically, it is an all-pair problem for picking two BVs, and even worse 
if a higher degree of the BVH is wanted. The problem could be cast into a optimization problem. Graph 
theory is also a popular choice, where nodes correspond to BVs and edges to possible merges. Greedy 
algorithms for picking edge-candidates for merges arc often adopted. 

Incremental methods arc governed by two main problems: finding a place to insert a BV and then 
updating the BVH [Zachmann et al., 2003b]. 

The various methods arc summarized here: 

Top-down methods 

• Statistical approach 

• Subdivision 

Bottom-up methods 

• Volume sampling 

- Structural information (medial surfaces) 

- Diffusion 

- Convex decomposition, voxelization 

• Merging 

- Optimization problems 

- Graph theory 

- Greedy (deterministic) algorithms 

Incremental methods 

• Inserting a BV, updating hierarchy 

In the following subsections we will treat both top-down and bottom-up methods in more detail; incre¬ 
mental methods arc only briefly touched. 

15.5.1 Top-Down Methods 

A top-down method usually adopts a. fit-and-split strategy, where for a given collection of primitives, a 
BV is fitted to cover all the primitives, and then the collection is partitioned into two or more groups. For 
each group a new BV is fitted and the collection is partitioned again. This is repeated until each group 
only contains a single primitive. Figure 15.28 illustrates the basic idea in pseudocode. Two operations 
need to be considered in order to apply a top-down method: 
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algorithm f it-and-split (parent,primitives) 
bv = BV(primitives) 
add bv as child to parent 
groups = split(primitives) 
for each g in groups do 
fit-and-split(bv,g) 
next g 

end algorithm 


Figure 15.28: Fit-and-split strategy of a top-down method. 


Fitting Given a collection of primitives and a BV type, how do we find the smallest and tightest fitting 
BV? 

Splitting Given a collection of primitives and the degree, d, how do we split the collection into d groups? 

The task of fitting a BV to a collection of primitives is deferred to Section 15.6. In this section we will 
only consider the task of splitting a collection of primitives into two or more groups. 

Top-down methods arc easily implemented. They arc probably also the most common used in practice. 
Most splitting methods seek a binary partitioning of the primitives resulting in binary BVHs. It is easier 
to split a set of primitives into two disjunct groups than into three or four groups simply because given a 
single primitive, one only needs to consider a single yes-no decision. It is also desirable to make the BVH 
balanced, to give it a good overall worst-case seeking complexity. Having balanced binary BVHs yields 
an 0( log 2 n) depth (height) for n primitives. 

Most splitting methods are based on the idea of first choosing a splitting axis to indicate a direction 
of splitting. Hereafter, a splitting point on the axis is often determined and a splitting or dividing plane 
can be set up, having the axis as normal and containing the splitting point. Having obtained a splitting 
plane , one is left with the task of deciding which side of the plane a primitive is mapped to. Most often 
the decision is based on the position of the centroid of the primitive w.r.t. the splitting plane. Figure 15.29 
contains 2D illustrations of the concepts and terms we have introduced. 

Without clever search structures the partitioning of k primitives takes O(k) time. If the fitting process 
is also O(k) for k primitives, then the BVH building process has the same recurrence relation as quicksort, 
that is, expected 0(n log n). If the BVH is reasonably balanced however, the worst-case running time is 
0(n 2 ). If the fitting is not linear in the number of primitives, but for instance 0(n log n), which is the 
case when a convex hull computation is involved, then the BVH building takes 0(n log 2 n). 

15.5.1.1 A Subdivision Method 

Using a subdivision method for finding the splitting plane is often fast and simple. For instance, in 
[Klosowski, 1998] a splitting plane orthogonal to the x-, y-, or z-axis is picked based upon the following 
criteria: 

Min Sum: Choose the axis that minimizes the sum of the volumes of the two resulting children. 
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Figure 15.29: Often-used terms and concepts in top-down methods. 


Min Max: Choose the axis that minimizes the larger of the volumes of the two resulting children. 
Longest Side: Choose the axis along which the BV is longest. 

The last method does not make much sense if spheres are used, but works great for AABBs, fc-DOPs, 
OBBs, or similar BV types. The idea in min sum and min max is to try out all axes, and pick an axis 
according to some greedy optimization strategy. They both require that six BVs must be computed, one 
pair for each axis, involving six fitting operations. 

An alternative algorithm not restricted to using a splitting plane orthogonal to the coordinates axis 
works by taking as input a set F of BVs each enclosing one primitive and a set C of points, which arc the 
barycenters of the primitives. The algorithm starts by finding c t , Cj 6 C with almost maximal distance. In 
[Zachmann, 1998], a nearoptimal pair is found by a simple 0{n) heuristic. Then a splitting axis parallel 
to cjcj is determined and C is sorted along that axis, which induces a sorting on F. After these initial 
steps, which arc done for every recursion, F can be split in two parts F\ and F>. To start with F\ = f, t , 
F -2 = fj, where f,. fj arc associated to c t , cj, respectively, then, all other f € F are treated in an iterative 
manner one by one and assigned to either F\ or F 2 , according to whichever BV increases the least. If both 
BVs of Fi and F 2 increase equally and in particular, if they do not increase at all, then / is added to the 
set, which has the least polygons. 

According to [Zachmann, 1998], the algorithm produces good trees that arc fairly well balanced. Fur¬ 
thermore, the algorithm is geometrically robust and can be applied to all unstructured models, that is, no 
adjacency information is needed or connectivity restrictions required. A very strong point of the algorithm 
is that it is capable of dealing with degenerate primitives, such as polygons collapsing to lines, points, or 
worse. 

15.5.1.2 A Statistical Method 

In order to increase the pruning capabilities of the BVH, we seek to make BVs fat, that is, we would like 
to avoid having long thin BVs. These span a large area of space, thus increasing the chance of overlapping 
a lot of other BVs. If the BV is more like a cubic or spherical shape, it will in a sense, be more local. 
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Figure 15.30: 2D example of a bad split and a good split. 


implying a better chance for pruning. Therefore, splitting along the axis of maximum spread increases the 
chance of creating more fat BVs than long thin BVs. Figure 15.30 illustrates the idea in a 2D Example 
where a rocket object is shown, together with a bad and good split. In the following, we will consider the 
theory for spatial point distributions and derive the covariance matrix. Afterward, we will give some hints 
on how to generalize the results. 

Consider Figure 15.31. We have drawn a set of points in 2D and assume that we want to split along 
an axis with maximum variance. In general, we have a point set with N Points: 

p = {Pi,P2,--Pn} • (15.11) 

Usually the points belong to 3D space, that is, 

G R 3 , for i = 1,..., N. (15.12) 

The mean is given by 

rn =JjY! p i- (15.13) 

i 

The direction of the maximum variance is denoted by the vector n. As can be seen from Figure 15.31, it 
is quite easy for a human being to intuitively pick n. If we subtract the mean from each p, we get a new 
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point set: 


where 


Figure 15.31: Illustration of what is meant by the axis of maximum spread. 


Q = • • -,Qn} , 


q l =p l -m, for i = 1,... ,N. 


(15.14) 


(15.15) 


The direction, n, of maximal variance is also the direction that maximizes the square sum of the projection 
of Q onto n, 

n = ^^(n-gj 2 . (15.16) 

i 

We want to maximize v w.r.t. n under the condition that enormn = 1. This is equivalent to maximizing 
the expression 

v + A (n • n — 1), (15.17) 

where A is a Lagrange multiplier. The maximum must occur at zero derivative, so we differentiate with 
respect to n x , n y and n z 


(^ E ( n -^) 2 + A ( n ' n - 1 )) = 0 ’ 


d / 1 


dn y \ N 


( n • 9*) 2 + A (n • n - 1) ] = 0, 


i(^E^-^) 2 + A (n-n-l)] =0. 


(15.18a) 


(15.18b) 


(15.18c) 
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For the derivative w.r.t. n x we get 


(4Z>-9i) 2 + A (n-n-i)) =o> 


4 


iy ^ ' , (iT'xQix 4“ IT'yQiy 4“ ClzQiz) 4“ A (^rr 4"" ^y 4"~ l) — 0, 


4 


iV 


^ ' 2 ( n x Qix 4~ n-yQiy 4~ 'ft'zQ.iz) Qix 4~ N2n x — 0, 


4 


^ ' {‘IT’xQix 4“ nyQiy 4“ TT’zQiz) Qix 4“ — 0. 


(15.19a) 

(15.19b) 

(15.19c) 

(15.19d) 


Obviously we get similar equations for the derivative w.r.t. n y and n z , so we end up with 


jy ^ ^ (TlxQix H - TlyQiy W'zQiz) Qix x — 0? 

i 

(15.20a) 

-jy ^ ^ {j^xQix W'yQiy ^ zQiz ) Qiy H - y ~ 0 ? 

i 

(15.20b) 

-jy ^ ^ ( W'xQix TlyQiy ^ zQiz ) Qiz ^^lz = 0. 

(15.20c) 


i 


We can rewrite this into a matrix equation 


r J-V o 2 

S- n Vix 

TV^Si Qiy Qix 

N Si QizQix 


n x 


n x 

N Si QixQiy 

N 5S liy 

TV Si QizQiy 


n y 

= -A 

n y 

Tv Si Qix Qiz 

Tv Si Qiy Qiz 

1 

w 


_ n z _ 


_ n z _ 


■v" 

c 


(15.21) 


That is, 

Cn = —An (15.22) 

The matrix C is symmetric and is called the covariance matrix. Furthermore, this is an eigenvalue problem 
with n and A as eigenvector and eigenvalue of C. Hence the eigenvector with maximum eigenvalue 
is the splitting axis. Splitting can now occur in two ways: 

• Divide the primitives into two disjunct sets, by using a splitting plane at the mean. 


• Project the centroids of the primitives onto the splitting axis and find the median. Then use a 
splitting plane running through the median. 
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algorithm bottom-up (L) 

C = L 

while \C\ > 1 do 
set P empty 
while C not empty do 
choose subset S from C 
B = BV(S) 

add S as children to B 
add B to P 
end 
C = P 
end 

root = C 
end algorithm 


Figure 15.32: Pseudocode illustration of a general bottom-up method. Taking a set, L, of leaf BVs as 
argument, the set C denotes the children BVs and P the new parent BV. 


Picking the median over the mean ensures a balanced BVH, however, using the median over the mean 
yields no improvement on the running times of a collision query [Klosowski et al., 1998]. Numerical 
experiments indicate the mean outperformed the median. 

Although covariance clearly shows how to obtain a sensible splitting axis, it is by no mean trivial to 
do so. There are many pitfalls involved in computing a good covariance matrix for a set of primitives. We 
deal with this in full detail in Section 15.6.1. 

The splatter method [Klosowski, 1998] is slightly different. It consists of first projecting the centroids 
of the primitives onto each of the three coordinate axes and then calculating the variance of each of the 
resulting distributions. Hereafter, one chooses the axis yielding the largest variance. 

15.5.2 Bottom-Up Methods 

In bottom-up methods the idea is to start by fitting BVs for the leaves of the BVH and then merging 
the leaves of subsequential nodes until the root is reached. A new BV must be created after each merge 
covering the entire subtree. 

Given n primitives, then n— 1 merges are required in order to build the BVH. Determining an enclosing 
BV after a merge can be done in 0(1) time, if one simply lets the BV enclose the geometry of the 
immediate children. For k primitives in the subtree, a fitting, which examines all the covered primitives 
one by one will take 0(k). If extremal points or a similar measure is sought, then the complexity is 
0(k\ogk). The total BVH construction time would be 0(n), 0(n log n), and 0(n log 2 n) using these 
fitting strategies respectively. Of course, here we have implicitly assumed that it takes 0(1) time to decide 
which nodes should be merged into a new parent node. 

A pseudocode illustration of the basic idea of a bottom-up method is shown in Figure 15.32. There 
are two main difficulties in a bottom-up method: 

• picking a subset of BVs at level l, which should form a parenting BV at level l — 1 
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Figure 15.33: Basic data structures used by the bottom-up mesh connectivity method. 


• computing the BV at level l — 1 

In the following, we will treat the first problem and defer the second to Section 15.6. 

15.5.2.1 Mesh Connectivity 

Mesh connectivity is the dominant strategy for applying BVHs to deformable objects [Volino et al., 1998, 
Volino et al., 2000, Larsson et al., 2001, Bridson et al., 2002, Bergen, 1997]. [Larsson et al., 2001] reports 
a performance increase of up to 10% for the collision query when using mesh connectivity. Deformable 
objects will be treated further in Section 15.7. Here, we will describe a bottom-up construction method 
from [OpenTissue, 2005], inspired by [Hubbard, 1996, Larsson et al., 2001, Bridson et al., 2002]. 

Given a mesh, the topology can be used to guide the bottom-up construction. Initially, leaf BVs are 
created, such that they cover exactly one face in the mesh. At the same time, a graph data structure is being 
built to represent the dual graph of the mesh. The dual graph is constructed as follows: a node is created 
in the graph for every face, and edges arc inserted between nodes in the graph if their corresponding faces 
arc neighbors in the mesh as illustrated in Figure 15.33. The graph edges describe potential merges of 
leaf nodes into parent nodes. During initialization, all edges arc assigned a priority penalizing a potential 
merge between the BVs stored in the end nodes. All the edges arc then inserted into a priority heap, 
allowing the algorithm to incrementally pick the edge with minimum priority. Thus the edge, which 
represents the two BVs that arc most beneficial, arc merged into a parent BV. Whenever a merge operation 
is performed, the corresponding graph edge is collapsed, thereby melting the two end nodes of the edge 
into a single node. The main loop of this algorithm is shown in pseudocode in Figure 15.34. When an 
edge collapse occurs, one has to guard against inconsistencies in the graph data structure. For instance, if 
multiple edges exist between the two merging end nodes, self-loops will exist in the resulting graph, and 
these must be removed during the edge collapse since it makes no sense to merge a node with the node 
itself. There is a direct correspondence between a node in the graph and a BV node. Therefore, we keep 
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algorithm construct (G) 
Heap Q 

for all edgese£ G do 
e.priority = cost(e) 
add e to Q 
next e 

while Q not empty do 
e = pop(Q) 
collapse (e,G, Q) 
sort Q 
end 

end algorithm 


Figure 15.34: Main loop. 


a pointer to the current BV, which the node represents. Furthermore, we keep track of the height this BV 
has in the BVH that is being built. 

Due to BV fitting, it is beneficial to keep track of the mesh geometry that is covered by a graph node, 
thus every time we collapse an edge in the graph, the resulting node will contain the union of the geometry 
covered by the two end nodes before the edge collapses. 

After a merging operation, some edges in the graph will have updated one of their end nodes to the 
merged node, thus they need to get their priority updated. The heap must also be updated to reflect changes 
in priorities. 

Figure 15.35 contains the details of an edge collapse operation in pseudocode. The cost function, used 
to assign priorities to the graph edges, can be used to control the overall topology of the BVH. A useful 
practical cost function consists of the sum of the heights of the BVs stored in the end nodes. This cost will 
try to favor a balanced BVH. 

Other criteria could be the volume that a resulting BV would have if a merge was carried out, thereby 
favoring smallest possible BVs. However, there is no telling if a greedy pick of smallest BV would yield 
a final BVH where all nodes are as small as possible. 

A real-life example of a BVH built with the mesh connectivity method is shown in Figure 15.36. The 
bottom-up method we have outlined is not specific for mesh connectivity, but rather a general graph-based 
algorithm driven by a greedy deterministic optimization algorithm. This implies that the graph does not 
need to be built from the dual graph of a mesh; instead, it could be the dual graph of the generalized 
Voronoi diagram of a volume sampling, which could be used to build approximating hybrid BVHs as 
shown in Figure 15.23. 

15.5.3 Incremental Methods 

Incremental methods are based on an insertion principle. Initially the BVH is empty, and BVs from 
an elementary set are inserted one by one into the hierarchy [Zachmann et al., 2003b]. Most incremental 
methods have complexity 0(n log n) [Zachmann et al., 2003b]. The elementary set of BVs can be thought 
of as the set of leaf BVs in the resulting BVH. The elementary set of BVs could consist of a set of BVs, 
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algorithm collapse(e, G, Q) 

[5, D\ =getEndNodes (e) 
add D.geo to S.geo 
B = BV (S.geo) 

add S.bv and D.bv as children to B 
S.bv = B 

S.height = min (S.heigh.t, H.height) + 1 
for all edges h between S and b do 
remove h from G 
remove h from Q 
next h 

for all edges h incidient to D do 
move h from D to S 
next h 

delete D from G 

for all edges h incident to S do 
h.priority = cost(h) 
next h 

end algorithm 


Figure 15.35: Edge collapse operation. 



Figure 15.36: Illustration of AABB BVH built using mesh connectivity, left image shows original polyg¬ 
onal mesh, middle image shows mesh with leaf AABBs superimposed, right image shows all AABBS at 
height four in the BVH. 
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algorithm incremental (B) 
while B not empty do 
choose b £ B 
if BVH empty then 
root = b 
continue 
else 

v = root 

while v leaf do 

choose child v' of v 
v = ,/ 
end while 
end if 

insert b into v 
end while 
end algorithm 


Figure 15.37: A typical incremental construction method taking an elementary set, B, of BVs as argument. 


where each BV in the set encloses a uniquely determined primitive. In other words, for a polygonal soup, 
each polygon is enclosed by a single BV, and all these BVs form the elementary set. The elementary set 
could also be built from an object segmentation, i.e., a volume sampling of an object can be a set of known 
BV geometry types. 

The general idea is to traverse the BVH built so far in order to find a node, which dictates an insertion 
point for the next elementary BV. We’ll label the insertion point node u and the elementary node that 
should be inserted, 6. A pseudocode illustration is presented in Figure 15.37. When a suitable v is found, 
an insertion will take place. In a typical insertion, v will be the parent of b. If u was a leaf prior to 
insertion, an extra sibling node must be created, which will contain the elementary BV originally stored 
in v. 

Having inserted a new BV in the BVH, the BV properties of all the ancestors might have been de¬ 
stroyed. In order to correct this one could traverse the insertion path backward from the newly inserted 
BV to the root, and while doing so update every BV node on the path to cover its children. 

Many variations over this scheme exist, however they all must deal with three basic problems: 

• picking the next b from B, such that a good BVH will be built 

• choosing the best child v' during the traversal of the insertion path 

• applying a good strategy to update the BVs on the insertion path after the insertion has taken place 

An optimization strategy can be applied to make these decisions by attributing 6s with a cost function and 
nodes v in the BVH with another cost function, and then making choices which minimize the cost. 
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15.6 Smallest Enclosing Volume 

Many types of BVs have been presented in the literature. The three most simple and widespread arc 
spheres, AABBs, and OBBs. In the following, we will describe fitting methods for these. We refer the 
interested reader to references in Section 15.9 for more details on other types of BVs. 

15.6.1 Fitting an OBB 

A frequently used representation for an OBB is a center position, c, and an orientation, R , given as 
a rotation matrix. Each column in R is a unit vector along a major axis of the OBB, and three half 
length extents along the three axes of the OBB. Thus, in a precomputation step the half length extents arc 
computed. Let’s for a moment assume that the orientation of the OBB is given as 

R = [vi v -2 v 3 ] • (15.23) 


For the vertices, p, with i 6 [l../c], of the set of primitives, we could compute the upper and lower extremes 
along each axis of the sought OBB by 


U 1 = max(t>i ■p 1 ,...,Vi-p k ), 

(15.24a) 

u 2 = max (v 2 ■ Pi, ■ ■ ■ , v 2 ■ p k ), 

(15.24b) 

u 3 = max (v 3 -p ll ...,v 3 -p k ), 

(15.24c) 

l 1 = min (v\ ■p l ,...,v l ■ p k ) , 

(15.24d) 

l 2 = min (v 2 • Pi,... ,v 2 • p k ) , 

(15.24e) 

l 3 =mm(v 3 - Pl ,...,v 3 -p k ). 

(15.24f) 

The width w, height h, and depth d of the OBB would be given by 

w = u l — l 1 , 

(15.25a) 

h = u 2 — l 2 , 

(15.25b) 

d = u 3 — l 3 . 

(15.25c) 


The half length extents are therefore given by (w/ 2, h/ 2, d/2), and finally the center of the OBB by 

c = i (l 1 + u 1 ) v 1 + i ( l 2 + u 2 ) v 2 + i ( l 3 + n 3 ) r, 3 . (15.26) 

In conclusion, knowing the orientation of the OBB, it is quite easy to compute its center and extents. 
Unfortunately, that was under the assumption of known orientation of the OBB. 

Generally speaking, if the set of primitives has a long and thin shape, so should the tightest enclosing 
OBB fitting the set of primitives. Covariance as discussed in Section 15.5.1.2 provides us with a powerful 
tool for finding an enclosing volume. For instance, we could use the axis of maximum variance to set 
up an orientation for an OBB. Figure 15.38 shows a 2D example where extremal points along the axes 



“book” — 2005/9/30 — 15:44 — page 484 — #496 




Figure 15.38: Using maximum variance to find the orientation of OBB. 



(a) No misalignment (b) Misalignment caused by internal nodes 

Figure 15.39: Influence of interior vertices causing a significant misalignment of the enclosing OBB. 
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Figure 15.40: Using points lying on the convex hull only to avoid misalignment caused by interior points. 


of maximum variance are used to compute an OBB. However, this headless approach suffers from some 
difficulties arising form the distribution of the points. For instance, the influence of interior vertices can 
cause an arbitrary misalignment of the enclosing OBB, as illustrated in Figure 15.39(b). To get around 
this, we could try to eliminate the contribution of interior points to the covariance. For instance, we could 
compute the convex hull of the set of points and then only use the points lying on the surface of the convex 
hull, thus ignoring all points lying inside the convex hull, as illustrated in Figure 15.40. It appeal's from 
Figure 15.40 that we have solved all our troubles, but misalignment could still in fact occur. If there is a 
nonuniform distribution of points on the convex hull, it could cause an arbitrary bad misalignment, as can 
be seen from Figure 15.41. The problem could be seen as a sampling artifact, and one way to remedy it 
would be to resample the surface of the convex hull with points to obtain a uniform distribution. However, 
this is a costly operation to perform and by no means as trivial as it may sound. A far better solution is to 
take the limiting case where we resampled the surface of the convex hull with infinitely many points, such 
that they would cover the entire surface in a uniform manner. This actually corresponds to integrating over 
the surface of the convex hull as shown in Figure 15.41(b). 

Although this method is general, some point sets can still cause misalignment due to special symmetry, 
such as is illustrated in Figure 15.42. There are currently no known methods to get around this. The 
covai'iance-based method we have outlined does not yield optimal OBBs, and they can easily be shown to 
be suboptimal. In Figure 15.43 the nonoptimality of the covai'iance-based alignment method is illustrated. 
In Figure 15.43(a), an initial object with its covariance-aligned OBB is shown. In Figure 15.43(b) a 
slightly altered object is shown superimposed on the original object. The object has been altered so that 
its convex hull still lies inside the original OBB, shown with dotted lines. The alteration is shown grayed. 
The change in shape causes the covariance alignment to pick another orientation of the OBB, shown with 
solid lines. The change in orientation causes a slight change in the extents of the OBB. Because the altered 
OBB covers the altered object, and the altered object covers the initial object, then the altered OBB also 
covers the initial object. By construction, the initial OBB covers the altered object. In essence we have 
two distinct OBBs both covering two different objects. 
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(a) Misalignment caused by nonuniform point 
distribution. 


(b) Integrate covariance over surface of convex 
hull to avoid misalignment. 


Figure 15.41: Nonuniform distribution of points on the convex hull causing misalignment of OBB. 



Figure 15.42: Special geometry causing bad misalignment; square has same statistical spread in all direc¬ 
tions, any may be a chosen orientation. The figure illustrates the worst-case fit. 


Now let us tty to derive formulas for integrating the covariance matrix over the surface of the convex 
hull. Let * denote a random point on the surface of the convex hull, then the entries of the covariance 
matrix 



Coo 

Cm 

C 02 

c = 

Cm 

C n 

Cl 2 


C*20 

C 2 1 

C 22 


(15.27) 


is by definition given as 


Cij = E [xiXj\ - E [xi\ E [xj] Vi,j G [0..2] , 


(15.28) 


where E\-\ is the statistical expectation operator, and x,, and Xj is the Fth and j'th coordinate of the 
random point. This is the statistical definition of covariance in comparison to the geometrical definition 
in (15.21). If we let S denote the surface of the convex hull and dA a differential area element, then the 





(a) Initial object with its (b) Slightly altered object inside 

covariance-aligned OBB. initial OBB, however, 

covariance-aligned OBB is 
different. 

Figure 15.43: Covariance-aligned OBBs are suboptimal. 
expectations above is by definition given as 

r , XidA 

E[Xi] = f dA , (15 ‘ 29a) 

r , f Q XjdA 

E[ Xj ] = Js 3 dA , (15.29b) 

r , XiXjdA 

E[x iXj \ = S j dA • (15.29c) 

Without loss of generality we will assume that the surface of the convex hull consists of a set of triangles, 
T k for /:: = 1,..., K . If this is not the case, these triangles can easily be obtained by standard triangular- 
ization methods [Berg et al., 1997, O'Rourke, 1998]. The three vertices of the A-'tli triangle arc given by 
p k , q k , and r k . A parameterization of the A-'tli triangle is written as 

x(s , t) = p k + u k s + v k t , where s G [0,1] , t G [0,1 — s] , (15.30) 

and the area of the A:'th triangle, A k , is written as 

A k = - u k x v k , (15.31) 

2 2 

where u k = q k — p k and v k = r k — p k . The total integrals over the surface S can be viewed as the sum 
of integrals over each of the triangles. In particular, the denominator in the above equations is 



(15.32) 
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Now the main idea is to change the integration variable to s and t, such that we can use our parameter¬ 
ization to evaluate the expectation integrals in (15.29a), (15.29b), and (15.29c). The change of variable, 
when integrating any function f(s, t ) over the triangle, is given by 


r rl rl—s 

/ fdA= u k x v k / / f(s,t)dtds. 

IT k 2 Jo Jo 


Applying this strategy to the denominator in (15.29a), (15.29b), and (15.29c) yields 

/ dA = Y f dA 

Js , JT k 


(15.33) 


dA 


(15.34a) 

? h 

' X V 

rl rl—s 

/ / dtds 

2 Jo Jo 

(15.34b) 

? k 

X V 

j (1 — s) ds 

(15.34c) 


2 Jo 


c X v k 

1 

2 2 

(15.34d) 

= A. 


(15.34e) 


The area of the convex hull is thus the sum of all triangle areas. Applying the same recipe to the enumerator 
in (15.29a) yields 


E [xi] = — / XidA 
A Js 


jZ J T ^ dA - 


and changing the integration variable and substituting parameterization give 
/ x k dA=\\u k xv k II [ { x k dtds 


= u k X v k 


'2 JO JO 
. rl rl—s 


'2 Jo JO x 
rl /» 1 —s 


(p k + u k s + v k t j dtds 


= ( u k x v k / / p k + u k s + v k tdtds . 

V 2 Jo Jo J i 

Notice that subscript refers to the Tth coordinate. Trivially solving the integrals yields 

J x k dA=(ju k xv k \^ p k + q k + r k . 


(15.35a) 

(15.35b) 


(15.36a) 

(15.36b) 

(15.36c) 

(15.37) 
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Rewriting, we get 



(15.38a) 

(15.38b) 


where rn k = ^ [p k + r k + q l ) is mean of the k’th triangle. Back substitution leads to the following 
solution for the first expectation 

E[x i ] = ^ j A k m i . (15.39) 

k 

It is trivial to see that by the same derivation, the solution to the second expectation is given as 

E [ Xj ] = A k m r (15.40) 

k 

The exact same approach can now be applied to the third and last expectation integral, that is, 


E \xjX j ] 


\ / XiXjdA 

A Js 

- y 

A ^ 

k ' 


'jrk 


x k x k dA. 


(15.41a) 

(15.41b) 


Changing integration variables and substituting the triangle parameterization leads to the following results 
with a bit of mathematical handiwork: 


E [x^x^] — 



(15.42) 


The derivation is long and quite trivial so we refer the interested reader to [Gottschalk, 2000] for a detailed 
proof. By back substituting into the definition of covariance we obtain the desired result 

Cij = E [xjXj] — E [x.j] E [: Xj ] (15.43a) 

= 2 + PiPj + + r k r k ^j - mfrrij , (15.43b) 

k 


where m s is the mean point of the entire surface. Observe that the covariance matrix is symmetrical, 
meaning that only six entries need to be evaluated. 


15.6.2 Fitting an AABB 

The axes aligned bounding box (AABB) was treated in detail in Chapter 12, and may be considered a 
special case of an OBB, where the orientation is given as 

R=[vi v 2 x 3 ] ,andxi = [l,0,0] r ,x 2 = [0, l,0] r ,x 3 = [0,0,1] T . (15.44) 
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Therefore, we can compute the minimum and maximum points of an AABB by 


u 1 = max (vi • p 1: ..., v\ ■ p k ), (15.45a) 

u 2 = max (v 2 • Pi, ■ ■ ■ ,V 2 • p k ), (15.45b) 

u 3 = max(v 3 ■ p L ,... ,v 3 ■ p k ), (15.45c) 

1 1 = min(t?i ■p 1 ,...,v 1 -p k ), (15.45d) 

1 2 = min (v 2 ■p l ,...,v 2 -p k ), (15.45e) 

1 3 = min (v 3 ■ p 3 ,... ,v 3 ■ p k ) , (15.45f) 

and then we have 

and r max =[u 1 ,u 2 ,u 3 ] 1 . (15.46) 

Notice that there is no need to save the orientation, since it is given implicitly. 


15.6.3 Fitting a Sphere 

We’ll first consider some simple cases where the set of primitives we want to enclose consists of a set of 
one point, two points, three points, and four points. We label these points as vq, v\, V 2 , and v 3 . Also, we 
assume that all points arc in general position, that is, no four points lie in the same plane, no three points 
lie on the same line, and no two points have equal coordinates. 

In case we only have a single point, we can trivially find an enclosing sphere by setting the center, c, 
equal to the point and the radius, r, to zero: 

c = vo, (15.47a) 

r = 0. (15.47b) 

In case we have two points, they must both lie on the surface of the smallest enclosing sphere, meaning 
that they arc equidistant from the center of the enclosing sphere 

c = \{v i-w 0 )+«o, (15.48a) 

r = \ ||vi - t? 0 || 2 . (15.48b) 

In the case where we have more than two points, and if all points do not lie on the surface of the enclosing 
sphere, then we can reduce the problem to the trivial case above. 

So without further fuss, we assume that all points lie on the surface of the smallest enclosing sphere, 
implying that they arc all equidistant from the center of the sphere. Let n denote the number of points, 
either three or four, then we have 

||c — Vi\\ 2 = r 0 < i < n. (15.49) 

If we take the square and rewrite the above constraints, we have 

(c - Vi) ■ (c - Vi) = r 2 , (15.50a) 

c ■ c — c ■ Vi — Vi ■ c + Vi ■ Vi = r 2 , (15.50b) 
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for all 0 < i < n. If we subtract the equation i = 0 from all the equations with i > 0, then we find 


c ■ c — c ■ Vi — Vi ■ c + Vi ■ Vi — (c ■ c — c ■ vq — vq ■ c + vq ■ t>o) d = r 2 — r 2 , (15.51a) 

-c • Vi - Vi ■ c + Vi ■ Vi + c • v 0 + V 0 ■ C - v 0 ■ V 0 = 0, (15.51b) 

—2Vi ■ c + 2i>o • c — 2vo ■ Vo = —Vi ■ Vi — Vo ■ Vo . (15.51c) 

We now add +2 Vi ■ Vo to both sides of the equation to give 

-2Vi ■ c + 2v 0 ■ c - 2v 0 ■ v 0 + 2v.i ■ v 0 = —Vi ■ Vi - v 0 ■ v 0 + 2 Vi ■ v 0 , (15.52a) 

U 

(Vi - v 0 ) ■ (c - v 0 ) = \ (Vi - v 0 ) ■ (Vi - Vo) , (15.52b) 

(Vi - V 0 ) ■ (c-v o) = 5 II Vi - vo\\l ■ (15.52c) 

Using the matrix and vector notation 



Vi - Vo 


' k\\vi-v 0 \\ 2 - 


V 2 - Vo 


\ \\v 2 - V 0 \\l 

M = 


,}. b = 



_ Vi-V 0 _ 


. 5 II Vi - Volla . 


we can write the derivations for all the equations 0 < i < n as 

M(c — vo) = b, (15.54) 

which allows us to write the solution for the center of the enclosing sphere as 

c = v o + (15.55) 

and the radius as 

r = ||c — vo|| 2 = ||M _1 6|| . (15.56) 

In the case where we have three points, which lie in the same plane, then the sphere center will also lie 
in that plane and M e M 2x2 . For four points, M 6 R 3x3 . M will always be invertible [Eberly, 2005a] 
regardless of the number of points. 

A randomized algorithm can now be used to compute the minimum enclosing sphere of a point set 
with n > 1 points using the above results. The idea is a simple trial and error strategy, where a minimum 
sphere is first constructed from a subset of the point set, and then it is verified whether all points lie inside 
the sphere. If so, the algorithm terminates, and if not, a new point has been found lying outside the sphere. 
The new point is then used as a surface point in the computation of a new minimum enclosing sphere. 
The algorithm continues in this manner until all points lie inside the sphere. The algorithm is called the 
Welzl algorithm [Welzl, 1991, Berg et al., 1997], and Figure 15.6.3 shows a pseudocode version of the 
algorithm. For more details, we refer the reader to [Welzl, 1991, Berg et al., 1997]. 
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function mini-ball (P ) 

return compute-mini-ball (n(P), 0 ) 

function compute-mini-Ball( P, R ) 
if P = 0 or |7?| = 4 

D =circumscribed-sphere (R) 
else 

p = tail(P) 

D = compute-mini-Ball ( P — {p}, R ) 
if p ^ D 

D = compute-mini-Ball ( P — {p}, R U {p} ) 
movepto head of P 
return D 


Figure 15.44: Welzl algorithm for computing minimum enclosing sphere. 


15.7 Handling Deformable Objects 

Deformable objects are challenging for BVHs. Typically, a BVH will be calculated for an object’s initial 
rest shape, but during a simulation the deformation of the object will cause the BVH to come out of sync 
with the object shape. Even worse, the deformations can be so extreme that self-intersections can also 
occur. 

15.7.1 Updating the BVH 

The simplest solution to getting the BVH to cover the geometry after a deformation is to rebuild it from 
scratch; however, this is not computationally very attractive, and prohibits real-time or even interactive 
use of the BVH. 

A faster approach is to refit the current misaligned BVH. Reports have shown that it is about 10 
times faster to refit an AABB tree of a triangle mesh than to rebuild the AABB tree [Bergen, 1997], 
In general, if we have a fast method like 0(1) to refit a parent BV to cover its children BVs, then we 
could simply traverse the BVH in a bottom-up fashion. For example, for a deformable object, a leaf 
usually covers a single polygon, indicating that it is fast to refit the leaf BVs, and if refitting of parents 
based on their children BVs is fast, such as time 0(1), then the algorithm runs in 0(n) complexity. 
This is a so-called bottom-up update used by many [Bergen, 1997, Volino et al., 1998, Volino et al., 2000, 
Bridson et al., 2002, Mezger et al., 2003]. It is a fast and simple scheme, but has several drawbacks. 

First of all, the bottom-up update requires one to traverse the entire BVH, which may not be a serious 
drawback for self-intersections, since in this case it is likely that all BVs will be visited in the collision 
query anyway. 

A second problem with the bottom-up update is that not all BV types can be updated in 0(1) time. 
Some BV types, like AABBs and fc-DOPs, share the nice property that if they tightly cover their children 
BV, then they also tightly cover the geometry of their children [Larsson et al., 2003]. Therefore, it is 
sufficient only to consider the children BV geometries when refitting a BV of these types, yielding a 
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Figure 15.45: Illustration of refitting versus rebuilding. The two children AABBs of the root AABB are 
shown in the figure. Observe the large sibling overlap that occurs with refitting. 


complexity of 0(d), where d is the branching factor, usually a small constant such that Did) ~ 0(1). 
Other BV types, such as OBBs or spheres, require us to examine all the geometry covered by the subtree 
of a BV in order to refit with a tight BV. If the subtree covers k primitives, then a good OBB fit can 
be obtained in 0(k\ogk), implying a worse-case ()(n 2 logn) complexity for the entire update. A good 
sphere fit can usually be obtained in expected O(k) complexity yielding a total of 0(n 2 ). 

A third problem with the bottom-up update is that even in case of tight refitted BVs, pruning capa¬ 
bilities can deteriorate, due to an increasing overlap between sibling BVs. Deformations that keep the 
adjacency relation intact have been reported to have no significant performance deterioration for intersec¬ 
tion [Bergen, 1997]. 

Figure 15.45 illustrates the problem of large overlap between sibling BVs, and shows how this problem 
could have been avoided by rebuilding the BVH. The first drawback of having to traverse the entire BVH 
can be avoided by a top-down approach, such that only those BVs used in a collision query arc refitted. 

If two objects arc distant, then only their root BVs will need to be updated, and if two deformable 
objects overlap only in a few places, then the tandem traversal will only require refitting of a few BVs. 
However, the top-down approach will not yield good performance when we consider self-intersections or 
close parallel proximities, such as cloth or skin on a human being, where every primitive is in contact with 
at least one primitive from the other object. In these cases, a collision query will traverse almost the entire 
BVHs of both objects and require most of the tree to be refitted. 

One advantage of the top-down method is that it is easily added to the collision query. During a world 
space or model space update, one simply also refits the two BVs. The top-down method also suffers from 
the two other problems that the bottom-up method had: for certain BV types, tight refitting is slow, and 
pruning capabilities can deteriorate, even with tight refitting sibling. 

Finally, comparing top-down with bottom-up methods, we can see that bottom-up methods rely on 
the fact that the children have been refitted prior to refitting a BV, which is not required for the top-down 
method. Thus, unless we want to evaluate the children, their children, and so on before refitting a BV, we 
need to come up with a clever way of avoiding the traversal of the entire BVH. In [Larsson et al., 2001] 
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root 


level (m/2) 

level (m/2)+1 
leaves 


Figure 15.46: Illustration of hybrid update, first levels of BVH are refitted bottom-up, the levels below 
m/2 arc refitted in a top-down fashion. 


the problem is solved by letting each BV in the hierarchy store the primitives covered by its subtree. Thus, 
in a top-down update method, one can simply refit a BV to cover the primitives stored in the BV. However, 
as pointed out by the authors, this results in a memory drawback. 

A hybrid update is suggested in [Larsson et al., 2001] where a bottom-up update is mixed with a top- 
down update, in the hope that a combination will give the advantages of both methods. The main idea is 
to try to minimize the number of BV updates. For a tree with depth m, the hybrid update method initially 
updates the m/2 first levels bottom up. During a collision query, when nonupdated BVs arc reached, 
they can either be updated top-down as needed, or a specified number of levels in their subtrees can be 
updated using a bottom-up method. Figure 15.46 illustrates the basic idea of the hybrid update method. 
According to [Larsson et al., 2001] their method is roughly five times faster than [Bergen, 1997], One 
will also observe that their numerical experiments hint that the complexity of the hybrid update method 
is similar to the bottom-up method, although the constants seem to be half as big for the hybrid update as 
for the bottom-up update (see Figure 3 in [Larsson et al., 2001]). 

15.7.2 Handling Self-Intersections 

By the nature of physical objects, their surfaces arc fully connected and closed, implying that all primitives 
arc in touching contact with their immediate neighbors. Thus, even without self-intersections, we will have 
0{n) contacts in a deformable object consisting of n primitives. Equation (15.10) therefore implies that 
without self-intersections, we should not expect to perform a collision query better than 

2nlog 2 (n) + 2n — 1 « 0(nlog 2 n). (15.57) 

For a deformable object, this is a lower bound, because we cannot exploit the idea of pruning as we did 
for static objects. Since every paid of an object obviously collides with itself, taking a BVH and testing it 
against itself will cause us to traverse all the way down all the paths to the leaves. 

Using the tandem traversal blindly will produce a lot of redundant work as we will show later. The 
literature has proposed two solutions to improve upon the complexity and redundancies of the traversal 
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Figure 15.47: Simple 2D example BVH for self-intersection query. The figure shows the original, finest, 
mid, and coarsest levels. 



Figure 15.48: Topology of BVH from Figure 15.47. 


problem: extending the overlap test in the tandem traversal with several other kinds of tests and using a 
single traversal instead of a tandem traversal. 

The traditional tandem traversal can be extended to include self-intersection by making sure that prim¬ 
itive pair testing is only performed if the primitives do not share a common point such as a vertex of a 
face. This will disregard immediate neighbors and the primitive itself from primitive pair testing. 

We’ll study the problems of using BVHs blindly for self-intersection testing by a simple example. 
In Figure 15.47 we show a small stick figure consisting of five sticks forming a diamond together with 
the BVs that form a BVH. The topology of the BVH is shown in Figure 15.48. Observe that we have 
chosen AABBs and that the BVH was built using mesh connectivity. A bounding volume test tree ( BVTT ) 
is shown in Figure 15.49, which is the result of performing a tandem traversal of the BVH shown in 
Figure 15.48 against itself. From the figure it is evident that the pruning capabilities have degraded com¬ 
pletely for a self-intersection test. Only in the C-G overlap were the traversals able to prune, as shown 
grayed in the figure. From the BVTT we see that a total of 40 BV overlap tests arc performed, from which 
8 primitive pair tests arc performed. 

In comparison with a brute all-sticks against all-sticks, which yields 10 tests, disregarding redundant 
tests will result in only 5 primitive pair tests. Obviously, the brute-force approach out performs the BVH 
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Figure 15.49: Bounding Volume Test Tree (BVTT), each node represents a BV test, children represent a 
descend in the tandem traversal, and primitive pair testing is shown as annotated triangles. 


in this case. 

In a real application, objects would be much bigger, and the extent of a single primitive would be far 
less than the extent of the entire model, implying that for a bigger BVH the traversal will be able to better 
prune those primitives lying distant from each other, such as the C-G overlap test in the example above. In 
conclusion, for large n, the BVH will beat the all-pair 0(n 2 ) complexity with its 0(n log 2 n) complexity. 

Some properties should be noted from the example: out of the 40 BV overlap test, 16 of them had 
redundancies, implying that 8 of these where done twice. In four cases, these redundant BV overlap tests 
led to primitive pair tests, implying that two primitive tests were done twice. We also see that if one can 
catch redundant tests lying close to the root, substantial improvements can be made. For instance, in the 
case of the redundant C-B and B-C test, only one of their subtrees needs to be explored, thus pruning the 
B-C tree would reduce the work with 9 BV overlap tests and 4 primitive tests. 

Since we already know that every leaf of the BVH will collide with the BVH, there is really no need 
to perform a tandem traversal. Instead, we can perform a single traversal. The idea is to iterate over 
all leaf BVs, testing each of them recursively against the entire BVH. The pseudocode for the single 
traversal is presented in Figure 15.50. For completeness, we have shown the BVTT in Figure 15.51 using 
a single traversal for self-intersection testing of the BVH shown in Figure 15.48. In comparison with 
Figure 15.49, notice that we perform just slightly better. This time there is a total of 38 BV overlap tests. 
We still perform the same amount of primitive pair tests, and there arc still 16 redundant BV overlap tests; 
however, there is one difference. The redundancies arc all leaf BV overlap tests—no internal redundant 
BV overlap tests exists. However, we can still not beat the brute-force method for the example. Another 
important observation is that the BVTTs for the single traversal have less height than the BVTT for the 
tandem traversal. 

A modified tandem traversal is presented in [Volino et al., 2000] for self-intersection testing. Fig¬ 
ure 15.52 shows the modified self-intersection traversal in pseudocode. The modified traversal exploits 
the fact that there is no need to test a BV against itself. It completely removes all redundant overlap tests, 
since the recursive sibling testing is invoked such that a tandem traversal is run only once between two 
children. To see the advantage of this traversal, a BVTT is shown in Figure 15.53. As seen in the figure, 
the number of overlap tests has dropped to 12, and the number of primitive tests is 3. The self-test algo¬ 
rithm is invoked 8 times. There is no redundant overlap tests or primitive tests. This modified traversal 
beats the brute-force approach, and it avoids the redundant leaf BV testing, which the single traversal 
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algorithm self-intersect (A : BVH ) 
for all leaf BVs B in A do 
single(root,B) 
next B 

end algorithm 

algorithm single ( A : BV, B : BV ) 
Queue Q 
push ( Q, A) 

while not Q empty do 
pop (Q,A) 

if not colliding (A, B) then 
continue 
if leaf (A) then 

contact-determination( A,B) 
else 

for all children C of A do 
push (Q,C) 
next C 
end if 
end while 
end algorithm 


Figure 15.50: Single traversal for self-intersection testing. 


suffers from. 

Mezger el al. [Mezger et al., 2003] have extended the work in [Volino et al., 1998] by using a top- 
down splitting approach for building fc-DOP trees, and a BV inflation technique, where applied, to ac¬ 
commodate proximity queries of a moving and deforming object. Furthermore, they have extended the 
curvature testing used by [Volino et al., 1998, Volino et al., 2000], with several other heuristics. We refer 
the reader to [Mezger et al., 2003] for more details. Here we will limit ourselves to present the basic idea 
behind the curvature testing extension. 

As we have seen, the adjacent structural nature of primitives causes a performance degradation on the 
BVH traversals, leading to deep BVTTs traversing all the way down to leaf BV tests. Both the tandem, 
single, and modified traversals we have presented seem to suffer from these artifacts. However, these 
problems will be remedied by extending the traversals with a curvature test. 



Figure 15.51: Bounding Volume Test Tree (BVTT) for single traversal. Compare with Figure 15.49. 
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algorithm self-test (A : BV ) 
if leaf A then 
return 
end if 

for all children C of A do 
self-test(C) 

for all children D right of C do 
recursive (C,D) 
next D 
next C 

end algorithm 


Figure 15.52: Modified traversal for self-intersection testing. Siblings arc tested against each other using 
the traditional recursive tandem traversal (see Figure 15.15); a child is tested against itself by a recursive 
call on all of its children. 



Figure 15.53: BVTT of modified traversal from Figure 15.52 on BVH from Figure 15.48. Square nodes 
denote calls to self-test, circles arc calls to recursive tandem traversal. 


In Figure 15.54, we show a simple example: A stick is bent from being completely flat to self- 
intersecting. We would like to formalize this measure of bending, such that we could use it in a BVH 
traversal. If we, for instance, knew a surface was flat, then there would be no need to perform any self¬ 
intersection testing on the surface. Looking at the normals of the stick, these tell us something about how 
much the stick is being bent. In the 2D case, it is obvious that in order for the stick to bend over and 
self-intersect, there must be two normals at some point on the surface where the angle between them is 
greater than ir radians. This angle is directly related to the curvature. 

The requirement for the difference in normals to be more than ir radians is a necessity, but not a 
sufficient condition for a self-intersection, as can be seen in Figure 15.55. Clearly the curvature suggests 
that there might be a self-intersection where there is none. Thus, we can only use a curvature test to prune 
away flat cases, not curled up cases. Moving into 3D from 2D complicates matters a little bit, since we 
now have a 2D surface bending in 3D space. 
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Figure 15.54: A stick bending in 2D with self-intersection. 



Figure 15.55: A stick where curvature indicates a self-intersection, but there is none. 


The problem is that even though the curvature indicates a flat surface, the surface might still be self- 
intersecting, such as the example of an elastic rubber sheet lying on a table, where two corners meet 
without bending the sheet too much out of the plane. Figure 15.56 illustrates the problem. A sufficient 
criteria is to require that the projected surface boundary onto a plane nearly coplanar with the flat surface 
does not self-intersect. These observations lead to the following definition: 

Definition 15.3 (Curvature Criteria) 

A continuous surface, S, with boundary, C, does not have any self-intersections when the following crite¬ 
ria are met: 

• there exists a vector v for which the normal n(p) at every point p G S fulfill 

v ■ n > 0 (15.58) 

• the projection of C onto the plane orthogonal to v has no self-intersections 

The definition is illustrated in Figure 15.57. When we apply the curvature test, we will need to update 



“book” — 2005/9/30 — 15:44 — page 500 — #512 


500 


CHAPTER 15. BOUNDING VOLUME HIERARCHIES 



Figure 15.56: A 2D self-intersecting surface in 3D space. 



Figure 15.57: A continuous surface S, with boundary C, fulfilling the curvature criteria (see Defini¬ 
tion 15.3). 
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Figure 15.58: An example surface region, consisting of four faces, a, b, c, and d with their respective 
normals. 



Figure 15.59: Curvature information stored for leaf BVs. 


the BVH with curvature information. Here we implicitly assume that a mesh-connectivity was used to 
build the BVH, implying that a BV node in the BVH covers a surface region. Its parent BV will cover 
a larger surface region, being the union of the BV node and its siblings. Furthermore, the parent surface 
region is a connected region implying that for two arbitrary points a continuous and unbroken curve can 
be drawn on the surface connecting the two points. 

Instead of trying to compute how big the normal angles arc for each surface region, it is more efficient 
to store the half space, from which a vector v can be found. Figure 15.58 shows a simple model we will 
use to illustrate the idea. Here a simple surface region is shown, consisting of four faces: a, b, c, and d. 
The corresponding BVH consists of 4 + 3 = 7 BVs: four leafs, two intermediate BVs each covering half 
of the surface, and one root BV covering the entire surface. 

For each leaf BV we will store the half space, from which a valid o-vcctor can be found for the 
corresponding surface as illustrated in Figure 15.59. Here circle cones arc used to represent the valid 
directions for u-vectors. When the BVH is updated in a bottom-up fashion, the curvature information 
can easily be propagated from the leaf BVs to the root BV. The half-space cone for a valid ?>vector for 
a parent BV is found as the intersection of the half-space cones for its children. This is illustrated in 
Figure 15.60. In the figure, the big arrows point in the direction of a parent BV. Observe that one can 
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Figure 15.60: Curvature information propagated from leaf BVs to root BV. 


easily find a suitable ^-vector from the half-space cone of the root, which will work as a n-vcctor for the 
entire surface. 

In practice, one does not store an actual half-space cone. Instead, the space of possible ^-vectors 
is discretized onto a fixed set of directions. These directions could, for instance, be picked from an axis- 
aligned unit cube as follows: each vertex, edge middle, and face middle yields a vector direction, providing 
us with a total of 26 directions. Other geometries arc discussed in [Volino et ah, 2000]. 

The benefit of the discretization is that half spaces can be efficiently implemented as bit-patterns. For 
instance, the 26 direction cubes can be encoded by attributing each BV in the BVH with a 32-bit unsigned 
integer, where each of the 26 first bits indicates whether the normal direction would be a valid n-vcctor for 
the surface region covered by the BV. During propagation of curvature information from leaf BVs to root 
BV, the curvature information of a parent is easily found by taking the bit-wise AND of all the bit-patterns 
of the children BVs. 

The self-intersection testing from Figure 15.52 is easily extended with the curvature testing. If the 
curvature criteria passes for a surface covered by a BV, then the recursion stops. This is shown in Fig- 
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algorithm self-test (A : BV ) 
v = get u-vector of A 
if valid v then 

if not projectedBoundarylntersect (A,v) then 
return 
end if 
end if 

if leaf A then 
return 
end if 

for all children C of A do 
self-test (C) 

for all children D right of C do 
recursive (C, D, true) 
next D 
next C 

end algorithm 


Figure 15.61: Modified traversal with curvature testing. 


ure 15.61. Observe that the recursive tandem traversal has been extended with an extra argument, which 
we will explain in detail shortly. When testing siblings against each other, we would also like to exploit 
the benefits of the curvature test. Obviously, the BVs of the siblings will overlap if the surfaces covered 
by the sibling BVs arc adjacent, that is, they share a common border. So we cannot hope to rely on the 
BV overlap test to prune the nearly flat adjacent surface regions covered by two sibling BVs. The BV 
overlap testing, however, will still work fine for pruning nonadjacent surfaces. Figure 15.62 illustrates the 
usefulness of the BV overlap test compared to the curvature test. As seen from Figure 15.62, we would 
like to extend the recursive tandem traversal with a curvature test when we arc dealing with two BVs that 
represent adjacent surface regions. The extended recursive tandem traversal is shown in Figure 15.63. The 
pseudocode in Figure 15.63 seems straightforward, but we have omitted one detail for clarity: how does 
one determine whether two surface regions arc adjacent? As suggested by [Volino et ah, 2000], each BV 
in the BVH is annotated with a list of surface vertices lying on the boundary of the surface region covered 
by the BV. The list only contains vertices indicating adjacency with another surface. In other words, no 
two vertices arc added to the vertex list of the BV representing the same adjacency information. The vertex 
list can be created initially, when the BVH is being built. Thus, there is no need to update any adjacency 
information during simulation. An adjacency test can now be performed by comparing the vertex lists of 
two regions: if they share a vertex, then the regions arc deemed to be adjacent. Theoretically speaking, 
cases could be conceived where the vertex list could be rather long, implying an expensive adjacency test. 
However, [Volino et al., 2000] report that for the usual cases, the lists hardly exceed six vertices, meaning 
that the adjacency can be considered to have constant complexity. Figure 15.64 shows an example of two 
vertex lists for two surface regions. According to [Volino et al., 2000], the boundary intersection testing 
is never decisive in the final collision detection process, and is therefore not performed. We would like to 
note that the boundary testing only makes sense for a surface with an open boundary, implying that it is 
superfluous for closed surfaces. [Volino et al., 2000] reports that the curvature testing yields an algorithm 
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Non Adjacent Surfaces rely on BV overlap 
overlap ~ detection No overlap ~ No detection 



Adjacent Surfaces rely on curvature criteria 
Curvature test failed ~ detection Curvature test succes ~ No detection 



Figure 15.62: Illustration showing that the BV overlap test is well suited for nonadjacent surfaces, but for 
adjacent surfaces the curvature test is better. 


that is linear in the number of colliding faces, which is far better than the 0(n log n) lower bound we have 
for the simple-minded use of the traversals. 


15.8 Contact Determination with BVHs 

When using BVHs for physics-based simulation, we are interested in obtaining a set of contact points as 
explained in Chapter 14. Typically a contact determination algorithm is used for processing the results 
from a narrow-phase collision detection algorithm. This approach can also be taken when using BVHs by 
enqueuing primitive pairs as the collision query algorithm encounters them, for example, in line eight of 
Figure 15.16. However, a separate contact determination phase is usually not needed when using BVHs, 
but can instead be intervened with the collision query. That is, during the collision query a contact deter¬ 
mination routine is invoked, which processes two primitive pairs, such as two triangles, and produces a 
set of contact points, which is enqueued as the collision result of the BVH collision query. This section 
discusses this approach to contact determination, while doing the collision detection queries of BVHs. 
As will be explained, the main problem with this approach is that certain degenerate cases result in mul¬ 
tiple reports of identical contact points. A few examples of degenerate cases is shown in Figure 15.65 
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algorithm recursive (A : BV, B : BV , adjacent ) 
if not colliding (A, B) then 
return 
end if 

if adjacent then 

adjacent = AdjacentState(A,B) 
if adjacent then 

v = get u-vector of ( A, B) 
if valid v then 

if not projectedBoundarylntersect(A,B,v) then 
return 
end if 
end if 
end if 
end if 

if leaf (A) and leaf(B) then 

contact-determination (A,B) 
return 
end if 

if descend (A) then 

for all children C of A do 
recursive (C, B , ad j acent) 
next C 
end if 

if descend (B) then 

for all children C of B do 
recursive (A,C, adjacent) 
next C 
end if 

end algorithm 


Figure 15.63: Recursive tandem traversal extended with curvature testing. 


and Figure 15.70. self-intersection adds yet another complication to these multiplicities, as illustrated in 
Figure 15.74. 

Multiple reported identical contact points arc undesired in physical simulation due to several reasons: 
first, they provide a performance decrease in iterative methods for collision resolving. However, they sel¬ 
dom cause different physical behavior or problems with the computations involved in the iterative collision 
resolving. Second, imagine that collision resolving is done by back-projecting penetrating contacts, that 
is, the first time a contact is encountered it will be projected to resolve penetration. If the new projected 
state is not reflected in the remaining multiple reported contacts, then further excessive projections will 
be carried out, yielding a final unwanted position. Finally, in the case of simultaneous collision resolving, 
such as the constraint method in Chapter 7, multiple reported identical contact points lead to numerical 
instability in the system matrix, meaning that not only do they cause a performance decrease, but they 
could also result in unsolvability of the numerical methods. 

Generally we could take two avenues to handle the problem. First, we could introduce a postprocessing 


© 


© 



“book” — 2005/9/30 — 15:44 — page 506 — #518 


506 


CHAPTER 15. BOUNDING VOLUME HIERARCHIES 



Figure 15.64: Illustration showing vertex lists for two surface regions, A and B. Notice that only a very 
few number of vertices is needed to capture all adjacency information about the two surfaces. 


step of the BVH collision query, which filters out multiple reported contact points, and second, we could 
devise some sort of clever bookkeeping during the traversal query, which is able to prune away multiple 
contact points before they arc even considered. The first solution is really just a reintroduction of a contact 
determination phase to the collision detection engine, and therefore really not interesting. Besides, the 
filtering needed indicates, at worst, an 0(n 2 ) time-complexity, which is undesirable. Therefore, we will 
focus on the second solution here. 

When doing physics-based animation, there arc two kinds of collision detection queries that interest us: 
interference and proximity queries. An interference query will search for the actual intersection between 
two objects, while a proximity query will search for the nearest parts of two objects. The important thing 
to notice is that an interference query only works when penetrations have occurred, whereas a proximity 
query will return results even when objects arc separated by a small threshold. Interference queries could 
prove to be a problem in simulation since the simulation method tries to avoid penetrations. Contact 
tracking is a possible way to resolve this, however, proximity testing is far better but requires knowledge 
of the movement of the objects as will be discussed in Section 15.8.2. 
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Figure 15.65: Interference queries can lead to multiple reported (. E , F) contacts. 


15.8.1 Interference Query 

Contact points between polygonal objects arc often represented by a pair of features, such as ( V., V), 
(V, E), ( V ., F ), ( E , E ), (E, F ), and (F, F), where V is a vertex, E is an edge, and F is face. This was 
introduced in Chapter 14.1. For polygonal objects, (F. F) contact points arc sufficient for representing 
the vertices of intersection between two objects, which will be the focus for interference query in the 
following. Case A in Figure 15.65 illustrates this. Here, two objects, each consisting of two triangles, 
arc intersecting each other. Observe that the (E, F ) intersections result in four intersection points, which 
describe the vertices in the polyline representing the intersection between the two objects. 

Case B shows a more difficult case, where one of the intersection points is given by an edge-edge 
intersection. However, one could just as easily pick one of the faces of the square object together with 
the intersecting edge from the triangular object, yielding an (E. F) contact point representing the same 
intersection point as the (F, E) contact point. There is some ambiguity in picking a face from the square 
object, since both faces yield the same intersection point, but two different contact points. Recall that 
contact points are uniquely identified by the features that constitute them, not by the position of the point 
in 3D space. 

The case could become even more complicated. Imagine moving the square object such that the 
intersecting edge intersects one of the end points of the edge from the triangular object. Now the ambiguity 
in contact point representation is even higher; a single (V, E) type, or one of two (E. E) types or one of 
five ( E , F) types could be picked. However, the important thing to notice is we can pick an (F. F) contact 
type. 

Case C shows another challenging case. Here, one intersection point is given by an (E, E) contact 
point type and can be resolved in the same manner as we just discussed for case B. The main difficulty 
illustrated by this example is that the other intersection point, which is given by a ( V. F ) contact point 
type, consists of the vertex of the square object and the face of the triangular object. Again, we see that 
we could pick one of the incident edges of the vertex from the square object, which is not coplanar with 
the face of the triangular object. Again we end up with an (E. F) type of contact, and again we have the 
same ambiguity in picking an edge, since both the edges lying above and below the triangular face will 
result in a contact point representing the same intersection point. We did ignore the edge from the square 
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object that is cop I an ar with the face of the triangular object. Picking this edge would result in a very bad 
situation, since this edge is collinear with the other contact point. 

In conclusion, we see that in the case of intersection, it is always possible to pick an ( E, F) type 
contact point to represent any intersection point between two polygonal objects. We also conclude that 
there is an ambiguity in picking the (E, F ) contact. This indicates a nonuniqueness, when we want to 
identify contacts by the features constituting them. The nonuniqueness would be devastating for any kind 
of bookkeeping algorithm and is therefore unwanted. 

If we allowed reporting types of contact other than (E, F ), then the nonuniqueness could be removed 
by always picking the contact representing the lowest dimension of features: a vertex has zero dimensions, 
an edge has one dimension, and a face has two dimensions. 

Theorem 15.1 (Uniqueness of Contact Point Representation) 

Let F /i and F B be the features from objects A and B, and let dim(-) be a function returning the dimension 
of a feature, then F t i and F B constitute a unique contact point, if, and only if, 

dim (Fa) + dim(F B ) = inu\(dim(F' A ) + dim(F B )), (15.59) 

where F' A and F B could be any neighboring features of the features Fa and F B , including the features 
themselves in direct contact with the intersection point. 

It should now be easily seen that in case A the four (E , F) contacts have the lowest dimension, in case 
B it is the ( E , F) contact and the ( E. E) contact that have the lowest sum, and in case C it is the ( E, E) 
contact and the (V, F) contact. Observe that by this rule we can never have an (F, F) type contact, since 
it will always be possible to decompose such a contact into a set of contacts with lower dimension. 

Proof of Theorem 15.1: 


(V) V) case: The neighbors of a vertex are the incident edges and the incident faces. Therefore, picking 
the neighbors of either the vertex of object A or B will increase the dimension sum. 

(V, E) case: The only possible way of reducing the dimension sum is if the edge could be replaced by 
one of its neighboring endpoints. However, this would indicate that the end vertex is touching the 
intersection point, and a (V, V) type would be the correct answer. If the intersection point is not 
touching the endpoints, then the edge can only be updated to one of its neighboring faces, which 
will indicate an increase in the dimension sum. The vertex itself cannot be replaced by any of its 
neighbors since this would imply an increase in the dimension sum as well. 

(V, F ) case: The same arguments as previously apply to the vertex; the face has only lower dimensional 
neighbors, the edges and vertices on its perimeter. If the intersection point is not touching the 
perimeter, then we cannot reduce the dimension sum. 

(E, E) case: Obviously, none of the edges can be replaced by their neighboring faces, since this will 
result in a dimension increase. The only possibility for a reduction, is to replace one or both edges 
with one of their neighboring end vertices. This would mean that the intersection point is touching 
the end vertices and that the replacement would be valid and correct and result in either a ( V, E) or 
(V, V) type case, which we already have proved. 
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(. E , F) case: The only possibility for decreasing dimension of E is by picking one of its end-points, which 
is impossible, unless the intersection point touches one of them, in which case we arc in ( V. F ) case, 
which we have already proved. The only possibility left is replacement of F by one of its neighbors, 
but this is impossible, if they arc not touching the intersection point. □ 

Usually a good rule of thumb is to treat the case analysis by testing the most restrictive cases first. In our 
case that would mean: 

theorem 15.1 (Rule of most restrictive case first) 

Given two triangles, it is first verified whether there are any (V, V) contacts, then (V, E) types, followed 
by (V, F), {E, E), and ending with (E, F ) types. 

In this way, testing the most restrictive cases first will allow us to disregard the special cases of higher 
dimensional cases. As an example, imagine we have a (V, V) type of contact. Having reported this 
contact, we need not consider the cases where the edge on a ( V, E) contact reduces to one of its end-point 
vertices. 

The Proof 5 and Proposition 15.1 can straightforwardly be combined into an algorithm for contact 
determination based on a case analysis. A pseudocode version of this algorithm is shown in Figure 15.66. 

As seen from the pseudocode, there are roughly 102 lookups, which is quite a lot of tests for previously 
reported contacts. This is computationally infeasible unless a very cheap lookup method is available. If 
BVH data structures arc not shared among objects, meaning for instance, that two identical box objects 
would each have their own instance of a BVH, then a cheap lookup function can be constructed by ob¬ 
serving the fact that a single nonface feature can only be paid of exactly one reported contact point. All 
features could therefore be attributed by a pointer to the reported contact they arc paid of, and thus provide 
a cheap constant time test for previously reported contacts. Of course, all contact pointers of the features 
must be reset to null prior to a query. 

If one only cares about knowing whether a feature could be paid of a new contact point, then a more ef¬ 
ficient time-stamping scheme could be adopted: instead of a contact point pointer, each feature is attributed 
with a time-stamp. Whenever a feature pair is used to report a contact point, then their time-stamps arc 
updated with the current time. Whenever one encounters a feature pair and wants to know if these could 
constitute a new contact point that should be reported, one simply tests that both their time-stamps arc less 
than the current time. If this is true, then a new contact point can be reported. 

The proposed idea of using a pointer or time-stamp requires a huge memory usage, since objects 
cannot share the same BVH instance. Observe however, that when working with deformable objects, 
separate instances arc needed, since two geometries can deform in many different ways. The consequence 
is that the updated BVHs do not look the same even though the initial geometry of two deformable objects 
were identical. This implies that the outlined strategy for a fast lookup method is feasible for interference 
detection of deformable objects. 

If BVHs arc shared among objects, which is often the case for rigid objects, another lookup method 
must be devised. A simple solution would be to give every feature a unique index, and then use the indices 
to look up contact point information in a map or similar data structure. 

Consider case A once again. Assume we arc using the contact determination outlined above. Then, 
as can be seen in Figure 15.65, the upper object has two faces; we’ll label them Fa and Fb- Similarly, 
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void interference-detection-by-case-analysis( Ta,Tb ) 

for all vertex pairs ( Va,Vb ) where Va in Ta and Vb in Tb 
if (Va,Vb) not reported before then 
if dist(VA, Vb)<£ then 
report (Va, Vb) 
end if 
end if 
next (' Va,Vb ) 

for all vertices Va in Ta and all edges Eb in Tb 

if ( Va,Eb ) not reported before (Va,Ve) where Ve in Eb then 
if dist(Vyi, Eb)<£ then 
report (Va,Eb) 
end if 
end if 
next ( Va,E b ) 

repeat above loop with A and B reversed 
for all vertices Va in Ta 

if dist(VA, Tb)<£ and Va inside perimeter of Tb then 
report ( Va,T b ) 
end if 
next Va 

repeat above loop with A and B reversed 

for all vertex pairs (Ea,Eb) where Ea in Ta and Eb in Tb 
if ( Ea,Eb ) not reported before then 

if no mixed endpoints vertex pairs of Ea or Eb reported then 
if dist(£U, Eb)<s then 
report (Ea, Eb) 
end if 
end if 
end if 

next (Ea,Eb) 

for all edges Ea in Ta 

if no endpoints of Ea have been reported then 
if Ea intersect Tb inside perimeter then 
report (Ea,Tb) 
end if 
end if 
next Ea 

repeat above loop with A and B reversed 
end algorithm 


Figure 15.66: Contact determination by case analysis for interference queries. 
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the lower object has two faces; we’ll label these Fq and Fp. Fa and Fb share a common edge, labeled 
Supper- Similarly, the faces from the lower object share an edge, which we’ll label £j OW er- As seen from 
the figure, both faces of the upper object intersect with both faces of the lower object. This means that 
during a BVH traversal the following pairs of primitive test will be generated. 


Leaf tests 

(. fA,F C ) 

Ua,f d ) 

Ub,F c ) 

Ub,F d ) 


Contact Points 

(-Eupper - Fc), (El ower 
(Supper - F D ), (Ei ower 
(S U pper - S C ), (E\ ower 
(S U pper ~ F D ),(E\ ower 


Fa) 

Fa) 

F b ) 

F b ) 


The rightmost column shows the possible contact points that can be generated from each pairwise primitive 
test. The table shows that four unique contact points will exist, as we expected. However, the table also 
shows that each contact point will be reported twice if we do not guard against multiple reported contact 
points. This artifact is unavoidable and comes from the fact that neighboring triangles share features. 
Therefore, it is a problem one always has to consider when using BVHs. The contact determination 
algorithm we have outlined, will prevent this artifact. In our discussion so far we have implicitly assumed 
that we have complete topology information about the underlying geometry. If this is not the case, such as 
for a triangle soup, we arc in real trouble, because the same edge and vertices might be redundantly stored 
in a triangle soup. Therefore, it is nontrivial to uniquely identify features. 

We will now outline another more practical approach to the problem of interference intersection. The 
main idea is to consider only (E, F) contact point types, since these arc sufficient for representing any 
intersection. As previously explained, there is sometimes an ambiguity in picking (E, F) contacts, for 
example, case B and C in Figure 15.65. The ambiguity implies that two different ( E,F ) contacts might 
be reported, which represent the same intersection point. In the following, we will ignore this ambiguity 
problem. This is not as bad as it might seem, since in practice these degenerate cases rarely exist, and even 
if they do, they arc short-termed. The reason for this is that cases like B and C in Figure 15.65 require 
perfectly aligned and positioned objects. In a general simulation, numerical inaccuracies and round off 
errors will cause small perturbations, even in the case of perfectly aligned objects. However, theoretically 
speaking, there is a danger for some of the problems we discussed earlier regarding badly conditioned 
numerics for simultaneous collision resolving. If an iterative method is chosen instead, the proposed 
intersection strategy that we will outline will not prove to be a problem. 

Let’s label the vertices of a triangle in counterclockwise order by vq, v\, and vi. Then the edges of a 
triangle arc given by the pairs of vertices (no — v\), (v\ — V 2 ), and (v 2 — no). Given two triangles T \ and 
Tb, there is a potential of six (E. F) contact points. The interference algorithm starts out by an exhaustive 
test of all six potential contact points, as shown in Figure 15.67. Now let’s consider the steps involved 
in taking care of a single edge-face test. First we need to ensure that we do not have multiple reported 
identical edge-face contacts. This can be done by the strategics we elaborated on previously. 

If we pass the test for multiple reported contact points, then we need to determine whether there really 
is a possibility of an edge-face contact. This can be done by testing the signed distance of the edge vertices 
w.r.t. the face plane. However, we need to consider the C case shown in Figure 15.65. What should we do 
if the signed distance is zero? In the case of the horizontal edge characterized by both vertices having zero 
distance, we simply ignore it. Since we arc looking for interference, it seems appropriate to favor edges 
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algorithm practical-interference( T a , Tb) 
do-Edge-Face (Ta-Vo,Ta-Vi,Tb ) 
do-Edge-Face (Ta-V\,Ta-V2,Tb ) 
do-Edge-Face (Ta-V2,Ta-Vo,Tb ) 
do-Edge-Face (Tb-Vo,Tb-Vi, Ta ) 
do-Edge-Face (Tb-Vi,Tb-V2, Ta ) 
do-Edge-Face (Tb-V2,Tb-Vq, Ta ) 
end algorithm 


Figure 15.67: Edge-face interference tests of two triangles. 


lying mostly on the backside of the triangle. Therefore, we will ignore edges where all vertices have a 
nonnegative signed distance w.r.t. to the face plane. Looking at case C in Figure 15.65, this means that we 
prune the coplanar edge and the edge on the front side of the face of the triangular object. 

Having determined that the edge actually crosses the plane of the triangle, we can now compute the 
intersection point between the edge and the plane of the triangle. This is done by parameterization of the 
edge, and if the edge is given by the vertices vq and v\, then any point on the edge can be given by the 
parameterization 

u(t) = (i>i — «o) t + Vo, where t G [0..1] . (15.60) 

Due to the lineality of the edge, the signed distance of any point is given by 

dist(plane(T), it(f)) = (dist(plane(T),i?i) — dist(plane(T), wq)) t + dist(plane(T), wq). (15.61) 


The intersection is given by the fact that the signed distance w.r.t. the face plane is zero. Exploiting this 
fact in the above equation allows us to solve for the parameterization parameter. This yields 


dist (plane (T), wq) 

dist(plane(T), wq) — dist (plane (T), wi) 


(15.62) 


Knowing the value of the /-parameter, the intersection point can be computed by using the parameteriza¬ 
tion. Having found the intersection point, we can now detect if the edge-face case we arc testing constitutes 
a new contact point by testing whether the intersection point lies inside the triangle boundary. Barycentric 
coordinates, to be explained below, arc very efficient for determining whether a point lies inside a triangle, 
and we will therefore use these in the last paid of our test. The barycentric coordinates wq, to i, and w 2 arc 
defined such that 

u(t) = wqVo + W\V\ + w 2 v 2 (15.63) 


is the point in the plane of the triangle closest to u(t). If 0 < w 0, w\,w 2 < 1, then the point lies inside or 
on the perimeter of the triangle. Computing the barycentric coordinates, w 0, w\, and W2 of u(t ) implies 
solving the linear system 


(w 0 - V 2 ) ■ (v 0 - V2) ©0 - V 2 ) ■ (v 1 - v 2 ) 

w 0 


(Vo - v 2 ) ■ (■U - v 2 ) 

_(v 0 - V 2 ) ■ (V! - V 2 ) («1 - v 2 ) ■ (vi - V 2 )_ 

Wl 


_(vi - v 2 ) ■ (■u - v 2 )_ 


Wo + Wl + W 2 = 1 . 


(15.64a) 

(15.64b) 
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We introduce the shorthand notation 


Oil 

012 


W 0 


V 

Ol2 

«22_ 


Wl 


>2. 


Isolating w\ from the second equation yields 

b 2 - ai 2 w 0 

W\ = - 

022 

b 2 a\2 

=- Wo, 

a 22 a 22 

and substituting this into the first gives 

a n u ; 0 + ai 2 ((b 2 ~ a 12 wo)/a 2 2 ) = h, 

anu’o + ( 012 / 022)62 - a i2 ( 012/022 )wo = h, 

(an - 012 ( 012 / 0 , 22 ))wo = h- ( 012 / 022 ) 62 - 

Letting f = a 12 /a 22 gives 


(15.65) 

(15.66a) 

(15.66b) 

(15.67a) 

(15.67b) 

(15.67c) 


we have 


Oil — Oi 2 
m = , 

(15.68a) 

n = b\ — fb 2 , 

(15.68b) 

n 

w 0 = —, 

(15.69a) 

m 

02 , 

W 1 =- two, 

(15.69b) 

a 22 

W 2 = 1 — Wo — W\. 

(15.69c) 


Since we always have an > 0 and a 22 > 0, a solution will always exist regardless of the value of a 12 , 
61 , and b 2 . A pseudocode implementation of the computation of the barycentric coordinates is shown in 
Figure 15.68. We have now walked through all the details of the edge-face testing. A complete pseudocode 
version of the test is given in Figure 15.69. 


15.8.2 Proximity Query 

A proximity query is the process of finding nearest and possible touching parts of two objects. The concept 
of contact points is easily expanded to cover separated objects as the closest points between features of 
two polygonal objects. Thus, the term contact point is used even for objects in close proximity without 
contact. 
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algorithm barycentric-coords (to, ti, t 2 , u) 
ail = ( v 0 - t 2 ) ■ (to - t 2 ) 
a 12 = (to - t 2 ) • (ti - t 2 ) 
a 22 = (ti - t 2 ) • (vi - t 2 ) 
bi = (to — t 2 ) • (u — V 3 ) 

#2 = (ti — t 2 ) • ( u — V3) 
f = ai 2 /a 22 
m = (an - ai 2 /) 

n = 61 - (& 2 /) 
wo = n/m 
wi = b 2 /a 22 - fw 0 
w 2 = 1 — wo — Wl 
return (wo,wi,w 2 ) 
end algorithm 


Figure 15.68: Computation of bai'ycentric coordinates. 


algorithm do-Edge-Face (to, ti, T) 

if existEdgeFace (to, ti, T) then 
return 
end if 

do = dist(plane(T),vo) 
d\ = dist(plane(T),vi) 
if do > 0 and d\ > 0 then 
return 
end if 

if do < 0 and di < 0 then 
return 
end if 

u = inter section (to, ti, T) 

(wo, wi, w 2 ) =barycentric-coords (to, ti, t 2 , u) 
if inside (wo, wi, w 2 ) then 
report (to,ti ,T) 
end if 

end algorithm 


Figure 15.69: Edge-face interference test 
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When doing proximity queries, we could apply the same strategy for reporting unique contact points 
as presented in the previous section. However, a simpler strategy is available based on the following: in 
the case of a proximity query, ( V. F ) and (E, E) contact points arc sufficient for representing any contact 
region between two polygonal objects. Since there arc no penetrations, there is no need for (E. F) contact 
points. 

As in the case of interference, we encounter the problems of ambiguity and multiple reported identical 
contact points. The latter being a consequence of neighboring triangles sharing common boundaries, 
as explained in the previous section. Here, we’ll briefly study the problem of ambiguity and convince 
ourselves that ( V , F) and (E, E) contact types arc sufficient. 

Looking at case A in Figure 15.70, we see that the minimum dimension contact is a (V, V) type, which 
is a unique contact for this sort of touching contact. However, we could just as well have picked any pair 
of edges, one from the upper object and one from the lower object, or we could have picked the vertex 
from one of the objects and an arbitrary face from the other object. It is obvious that we can suffice with 
(V, F ) and (E. E ) contact types for this kind of touching contact, and it is apparent that the ambiguity for 
this case is really bad. The number of possible contact points for representing the same touching point 
is quadratic in the number of features containing the touching point. However, in any simulation system, 
this kind of contact will occur rarely, and even if something similar is set up, numerical accuracies and 
round-offs will add perturbations. 

In case B in Figure 15.70, we have a trivial example of a (V, F) type of contact. In this case there 
is no ambiguity, however, imagine moving the cone such that the vertex lies on top of the border of the 
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triangle. The touching point could still be represented by the (V, F) contact, but again we have ambiguity 
in choosing a representation for the touching contact. 

The C case in Figure 15.70 illustrates the ideal (E, E ) type of contact, where there is no ambiguity in 
the representation. However, displacing one of the objects, such that the touching point on the common 
edge is moved to the boundary of the other object, results in ambiguity of the contact representation. 

Actually, case B and C show why it is (V, F ) and (E. E) contact types that arc the smallest set of 
contact types that suffice for representing a touching contact region between two polygonal objects, since 
in these two cases the contact types arc the minimum dimension representing the touching point. 

Usually collision queries arc invoked at discrete times. This leads to well-known artifacts, such as 
tunneling—objects that pass through each other (see Chapter 6.4). For proximity queries with BVHs, it 
can also lead to unexpected interference, such as penetration of objects. This is undesirable, since we arc 
seeking only touching or near-touching contacts. This calls for continuous collision detection. 


15.8.3 Continuous Collision Detection 

Unlike discrete collision detection, continuous collision detection deals with every collision event in a 
given time span. There arc two flavors of continuous collision detection. We can either look forward in 
time, from the current instance of invocation to the next expected instance, or we can look backward in 
time, from the current instance of invocation to the previous instance. 

Assuming that the time-step is small, it is justified to look upon the motion between the two instances 
of time as being linear. The error in this assumption is linear in the size of the time-step, h, that is, 
O(h). From a convergence theory point of view, we can simply let the time-step go to zero and the 
approximation would be exact. In practice, this is of course, impossible, but the error can be made so 
small that it is negligible compared to errors coming from numerical inaccuracies and round off. 

Having assumed linear motion, we can compute the space swept by a single triangle over the time- 
step. Say that the three vertices of a triangle move with velocities Uq, u \ , and u 2 , then we expect the 
triangle to be at the candidate positions given by 


v' Q = vo + huo, (15.70a) 

v\ = v\ + hui, (15.70b) 

v ' 2 = v 2 + hu 2 . (15.70c) 

If we look backward, the candidate positions would be 

v ' 0 = Vo — huo, (15.71a) 

v\ = V\ — hui, (15.71b) 

v 2 = v 2 — hu 2 . (15.71c) 


Or even better, if we cached vertex positions, then we would know the exact candidate position, without 
having to compute them as above. Instead, we could estimate the linear velocity they must have moved 
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with as 


u 0 


U 1 


U 2 


Vo - v ' 0 
h 

vi ~ v'i 

h 

v 2 - v ' 2 


h 


(15.72a) 

(15.72b) 

(15.72c) 


A simple approach to continuous collision detection is to update the BVH prior to a collision query, in the 
same way we would do for a deformable object. The only difference is in how the leaf-bounding volumes 
are updated. Traditionally, these would enclose the geometry at the current instant of time. However, for 
continuous collision detection, we would make it enclose the geometry both at the current position and at 
the candidate position. So in the case of AABBs this would imply 


Prain = min (v 0 ,v 1 ,v 2 ,v' 0 ,v' l ,v 2 ) , (15.73a) 

Pmax = max(vo,Vi,V 2 ,v > 0 ,v' 1 ,v , 2 ) . (15.73b) 


Observe that the update of the BVH is the same regardless of the flavor of the continuous collision detec¬ 
tion. The only difference lies in how the candidate positions arc picked. 

The approach we have described here is most convenient for deformable objects, where the BVHs 
arc not shared among objects, and BVHs arc updated anyway to adapt to the deformation of objects. For 
rigid objects, continuous collision detection can be made more efficient simply by updating the pairwise 
BVs during the traversal, but before they arc tested against each other for collision (see [Eberly, 2005b, 
Eberly, 2005c] for an example or [Redon et al., 2002, Redon, 2004b, Redon, 2004b, Redon et al., 2004b, 
Redon et al., 2004a] for more details on continuous collision detection). 

During a query traversal when two overlapping leaf bounding volumes arc encountered, this indicates 
a possible contact over the time-step we arc looking at. Therefore, the contact determination needs to find 
the point in time where a touching contact first appeal's. 

The contact determination works by examining all possible contact types, exactly as we did in the case 
of interference. However, this time we look for (V, F) and (E. E) contacts. The pseudocode is shown in 
Figure 15.71. In each of the contact cases, first we will try to find the time of collision and then perform 
the contact determination of the geometries at their positions at that time. 

Due to our assumption of linear motion over the time interval, h, the time of contact must be char¬ 
acterized by the vertex lying in the face plane in the case of a (V,F) contact or by the two edges being 
coplanar in the case of an (E. E) contact. Regardless of the case, we seek a point in time where four 
points all lie in the same plane. 

The idea is to use three points to compute a plane normal, and then use one of the three points to find 
a distance between the plane and the origin. Finally, the fourth point is used for a point in the plane test. 
For convenience, in the case of the (V, F), let us label the points of the triangle in counterclockwise order: 
Xi, x 2 , and X 3 and the point of the vertex x±. In the case of an ( E. E) type, we label the end-points of 
the first edge X\ and x 2 , and the end-points of the second edge *3 and X 4 . The corresponding velocities 
are in both cases labeled u\, u 2 , U 3 , and 114 . Now we’ll set up two vectors x 2 — x\ and x% — x\, and 
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algorithm practical-proximity( T a , T\,) 
do-Vertex-Face (Ta-Vo,Tb ) 
do-Vertex-Face (Ta-Vi,Tb ) 
do-Vertex-Face ( Ta-Vi,Tb ) 

do-Vertex-Face (Tb-Vo,Ta ) 
do-Vertex-Face ( Tb-Vi,Ta ) 
do-Vertex-Face (Tb-V2,Ta ) 

do-Edge-Edge (Ta.vo,Ta.vi,Tb.vo,Tb.vi ) 
do-Edge-Edge (Ta-Vi,Ta.V2,Tb-Vo,Tb.vi ) 
do-Edge-Edge (Ta-V2,Ta-Vo,Tb-Vo,Tb.vi ) 

do-Edge-Edge (Ta-Vo,Ta.vi,Tb-Vi,Tb.V2 ) 
do-Edge-Edge (Ta-Vi,Ta-V2,Tb-Vi,Tb-V2 ) 
do-Edge-Edge (Ta-V2,Ta.vo,Tb-Vi,Tb.V2 ) 

do-Edge-Edge (Ta-Vo,Ta.vi,Tb-V2,Tb.vo ) 
do-Edge-Edge (Ta-Vi,Ta.V2,Tb.V2,Tb-Vo ) 
do-Edge-Edge (Ta-V2,Ta-Vo,Tb.V2,Tb.vo ) 
end algorithm 


Figure 15.71: Practical proximity contact determination between two triangles. 


take the cross product of these two to obtain a plane normal 

n = ((x 2 - Xi) x (*3 - 2 : 1 ). (15.74) 

The point x\ must lie in the plane, so the distance, d, to the origin is given by 

d = n ■ x i. (15.75) 


In order for the last point to lie in the plane, then its distance to the plane must be zero, that is, 


n ■ x^ — d = 0. 


(15.76) 


Substitution leads to 


(( x 2 - Xi) x {x 3 - * 1 )) • (*4 - x-i) = 0. (15.77) 

Whenever this equation is fulfilled, the four points will lie in the same plane. The last thing we need to 
consider is the motion of the points. That is, we have 

((x 2 (t) - Xi (t)) x (x 3 (t) - ®1 (/))) • (x 4 (t) - ®1 (t)) = 0, (15.78) 


and we want to determine the smallest nonnegative value t < h, which makes the above equation true. 
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Now we find 


Xi(t) 

X2 (t) 
*3 (t) 
X 4 (f) 

By substitution we have 


Xi + Ult , 

(15.79a) 

x 2 + u 2 t, 

(15.79b) 

X 3 + U 3 t , 

(15.79c) 

X4 + u 4 t. 

(15.79d) 


(((*2 - X{) + (U 2 ~ Wl) t) X ((* 3 - £Ci) + {u 3 - Ml) /)) • ((*4 “ *l) + ©4 ~ «l) t) = 0. (15.80) 


This is a cubic polynomial, and an analytical solution exists. Its three roots can be found, and the ver- 
tice’s positions can thus be found at the specific times. A rounding error may hide a collision at the 
boundary between two time-steps; a test at the end of the time-step t = h should therefore by performed 
[Bridson et al., 2002]. 

Having found the point in time where the four points are coplanar, the (V, F) case is similar to the 
(E. F) case in the interference testing after having found the intersection point. The difference is that 
the vertex position at the time of contact replaces the intersection point. The pseudocode in Figure 15.72 
illustrates the basic idea. We will now focus on the (E, E ) case. As in the (V, F) case, we first find the 
roots, and then we examine the geometries in ascending order. 

For simplicity we let x\, X 2 , * 3 , and x 4 denote the edge geometry positions at the instance in time 
corresponding to a root. First we test whether the edges are parallel. This is the case, when 


(x 2 - *1) x (x 4 - x 3 ) = 0 . 


(15.81) 


In practice, an equality test will not work, so we use a threshold test instead. If the test succeeds, then a 
dimension reduction technique can be used by projecting the vertices of one edge onto the line running 
through the edge. However, one could also simply drop the case, since if a touching contact exists, it could 
just as well be represented by two (V, F) type contacts. Therefore, we will only consider the general case 
where the two edges are in general position, that is, there is exactly one touching point between them. 

Parameterization of the two edges with the a and b parameters, yields 

X\ + a (x 2 — * 1 ), (15.82a) 

*3 + b (x 4 — x 3 ). (15.82b) 


The touching point between the two lines must also be the closest point, and the closest point is character¬ 
ized by the minimum distance, so we seek the values of a and b, which minimizes 


V ((*1 + a(x2- *1)) - (® 3 + b (x 4 - X 3 ))) 2 . 

Taking the derivative w.r.t. a and b yields the so-called normal equations, 


(x 2 - * 1 ) • 

(x 2 - Xi) 

- (*2 - Xi) 

• (®4 ' 

- * 3 ) 

a 

- (x 2 - Xi) 

■ (x 4 - x 3 ) 

(*4 - * 3 ) ' 

(®4 - 

•* 3 ) . 

b 


(x 2 - * 1 ) • (x 3 - x 4 ) 
- (x 4 - x 3 ) • (x 3 - *1) 


(15.83) 


(15.84) 
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algorithm do-Vertex-Face(t, T) 
if existVertexFace (v,T) then 
return 
end if 

for each root t in ascending order do 
xi = T.v i + tT.u i 
X2 = T.V 2 + tT. M2 
*3 = T.V 3 + tT.U 3 
X4 = v + tv.u 

H = triangle{x\, X2, X3) do = dist(plane(H),X4 ) 
di = dist(plane(H ), X4) 
if do > 0 and d\ > 0 then 
return 
end if 

if do < 0 and d\ < 0 then 
return 
end if 

u = inter section (*4, H) 

(■ wo , w 1,102) =barycentric-coords (xi, X2, *3, X4 ) 
if inside (wo, un, 102) then 
report ( v,T) 
return 
end if 
next t 

end algorithm 


Figure 15.72: Vertex-face proximity test. 


Solving for a and 6, we can compute the closest points between the two lines running through the two 
edges. If a and b both arc not below zero or above one, then a contact point can be reported. The 
pseudocode is outlined in Figure 15.73. When looking backward in time we arc actually seeking the first 
point of contact. This point in space is attractive for simulation methods, since it implies the point in time 
a simulation should be rewound in order to avoid penetration. 


15.8.4 Self-Intersection 

Self-intersection queries add one more complication to the contact determination discussed in the previous 
sections. Observing that neighboring faces touch each other along common boundaries, trivially leads to 
a self-intersection along these common boundaries. In Figure 15.74, two triangle faces arc shown, and 
during a query of self-collision, the bounding volumes of these two faces trivially intersect each other 
leading to a primitive test between the two faces. However, we already know that the two faces would 
result in contact points being reported corresponding to the shared boundary between the faces. A simple 
prescreening can be applied to the contact determination avoiding the unpleasanties of this problem. The 
prescreening consists of a quick rejection, and we only run contact determination on triangular - faces, 
which do not share any nodes. Thus, no immediate neighboring triangles will yield any contact points. 
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algorithm do-Egde-Edge (x'i, x' 2 , x' 3 , x 4 ) 
if existEdgeEdge {x 4 , x' 2l x' 3 , x' 4 ) then 
return 
end if 

for each root t in ascending order do 
xi = x[ + tu\ 

X2 = x' 2 + tU 2 
X3 = x'z + tU3 
X4 = X 4 + tU4 

if (x2 — Xi) x (*4 — *3) > £ then 
return 
end if 

(a, ft) =closest-point (* 1 , *2, *3, *4) 
if 0 < a < 1 and 0 < 6 < 1 then 
report ( x\ , x'2 , x'3 , x 4 ) 
return 
end if 
next t 

end algorithm 


Figure 15.73: Edge-edge proximity test. 


15.9 Previous Work 

Bounding volume hierarchies have been around for a long time. Consequently, there is a huge wealth of 
literature on bounding volume hierarchies. Most addresses homogeneous bounding volume hierarchies 
and top-down construction methods. A great variety of different types of bounding volumes has been 
reported: spheres [Hubbard, 1996, Palmer, 1995, Dingliana et al., 2000], axes aligned bounding boxes 
(AABBs) [Bergen, 1997, Larsson et al., 2001], oriented bounding boxes (OBBs) [Gottschalk et al., 1996, 
Gottschalk, 2000], discrete orientation polytypes (k-DOPs) [Klosowski et al., 1998, Zachmann, 1998], 
quantized orientation slabs with primary orientations (QuOSPOs) [He, 1999], spherical shell and swept 
sphere volumes (SSVs) [Krishnan et al., 1998, Larsen et al., 1999]. In general, it has been discovered that 



Figure 15.74: Self-intersection needs special treatment of neighboring faces, since neighborship implies 
self-intersection along common edges. 
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there is a tradeoff between the complexity of the geometry of a bounding volume and the speed of its 
overlap test and the number of overlap tests in a query. 

In contrast to bounding volumes types, only a little has been written on approximating bounding 
volume hierarchies. To our knowledge, [Hubbard. 1993] pioneered the field, where octrees combined with 
simulated annealing were used to construct a sphere free. In [Palmer et al., 1995, Palmer, 1995], a superior 
bottom-up construction method was added, which is based on medial surface (M-reps) [Hubbard, 1996]. 
More recently, [O’Sullivan et al., 1999, Dingliana et al., 2000] used approximating sphere-trees built in a 
top-down fashion based on an octree for time critical collision detection, and [Bradshaw et al., 2004] used 
an adaptive m-rep approximation based on a top-down construction algorithm. 

Even less has been written about heterogeneous bounding volume hierarchies, although object hi¬ 
erarchies of different primitive volume types arc a widely used concept in most of today’s simulators 
[ODE, 2005, Vortex, 2005, Karma. 2005], The SSVs [Larsen et al., 1999] is one of the most recent pub¬ 
lications. The general belief is, however, that heterogeneous bounding volumes do not change the funda¬ 
mental algorithms, but merely introduce a raft of other problems. It is also believed that heterogeneous 
bounding volumes could provide better and more tightly fitting bounding volumes resulting in higher 
convergence toward the true shape volume of the objects. This could mean an increase in the pruning 
capabilities and a corresponding increase in performance. 

Most of the work with bounding volume hierarchies has addressed objects that are represented by 
polygonal models. Many experiments also indicate that OBBs (and other rectangular volumes) provide 
the best convergence for polygonal models [Gottschalk et al., 1996, Gottschalk, 2000, Zachmann, 1998, 
Larsen et al., 1999], while spherical volumes are believed to converge best toward the volume. The un¬ 
derlying query algorithms for penetration detection, separation distance, and contact determination of 
bounding volume hierarchies have not changed much. In their basic form, these algorithms arc noth¬ 
ing more than simple traversals [Gottschalk, 2000], The stack-list traversal [Zachmann et al., 2003a] is 
thread-safe and overcomes the problem of inefficient updating of BVs, meaning that queries can be done 
in parallel even though they share the same BVH data structure. 

M-rep-based methods arc the state of art for bottom-up construction methods [Hubbard, 1996] and 
top-down construction [Bradshaw et al., 2004], For deformable objects such as cloth, bottom-up construc¬ 
tion based on mesh topology [Volino et al., 1995, Volino et al., 2000, Bridson et al., 2002] is the preferred 
choice. In [Bergen, 1997], a median-based top-down method was proposed for building an AABB free. 
[Larsson et al., 2001] suggested using a mesh connectivity tree in a top-down construction method. Ex¬ 
tensions to deformable surfaces by curvature test are discussed in [Volino et al., 1995, Volino et al., 2000], 
In [Larsson et al., 2003] an update method for morphing deformation based on blending is proposed. The 
method out performs their hybrid update method [Larsson et al., 2001], but is not as generally applicable. 

In [Bergen, 1997], initial AABB trees are built in local object coordinate systems. The trees arc 
built using classical top-down splitting methods, and the entire AABB free is updated in a bottom- 
up fashion ensuring that making parent AABBs enclose their children AABBs. During runtime, the 
AABB trees are moved causing the local defined AABBs to become OBBs. A variation of the sep¬ 
aration axis overlap test method is used to deal efficiently with this problem. Self-intersections arc 
treated in [Volino et al., 1995, Volino et al., 2000] by organizing the boundary of an object into a hier¬ 
archical representation of subsurfaces, each of which has no self-intersections. The BucketTree algorithm 
[Hirota, 2002, Ganovelli et al., 2000] is, in our opinion, interesting because it takes a different approach to 
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updating a spatial data structure, which is easily extended to more complex scenarios. Instead of updating 
the spatial data structure at every iteration, a fast method is used to remap primitives into the spatial data 
structure. In [Larsson et al., 2001], a top-down method is used for building AABBs in order to handle 
deformation of arbitrary vertex positioning of meshes, and mesh connectivity is analyzed when splitting 
parents to children. Furthermore, in order to minimize the number of AABB updates, the method uses 
a top-down update approach while doing a tandem traversal. A bottom-up update method is used on the 
remaining subtree when at predefined depth. 

Even though there are several ways for constructing bounding volume hierarchies, there has been 
a tendency to use top-down methods. Some believe that bottom-up methods would be superior to top- 
down methods in the sense that smaller and tighter bounding volume hierarchies can be constructed. 
Most recent work with bounding volume hierarchies has focused on trying out new kinds of bounding 
volumes, figuring out better methods for fitting a bounding volume to a subset of an object’s underlying 
geometry, finding faster and better overlap test methods, and comparing homogeneous bounding volume 
hierarchies of different bounding volume types with each other. In order to improve performance of 
traversal algorithms, depth control, layered bounding volumes, caching bounding volumes, and shared 
bounding volumes have been studied. 

Depth control: In time-critical applications it can sometimes be beneficial to set a limit on the depth 
a traversal is allowed to proceed in a bounding volume hierarchy, thereby trading accuracy for 
performance [Hubbard, 1993]. 

Layered bounding volumes: Having recognized the tradeoffs between the complexity of the geometry 
of the bounding volumes and their overlap test speed, simpler geometries arc tried out first in order 
to get a quick rejection test. Spheres arc most commonly used [Gottschalk, 2000, Eberly, 2005b], 

Caching bounding volumes: Caching bounding volumes from previous invocations create witnesses that 
can exploit spatial and temporal coherence [Gottschalk, 2000, Eberly, 2005b], 

Shared bounding volumes: This changes a hierarchy from a tree into a DAG [Hubbaid, 1996]. 
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Feature-Based Algorithms 


A feature-based algorithm works directly on the features of an object. For polygonal meshes, the fea¬ 
tures arc the vertices, edges, and faces of the meshes. In other words, a feature is a geometric primitive, 
and we therefore classify a narrow-phase collision detection algorithm as being feature based whenever 
the algorithm works directly on the geometric primitives of the objects. This is a broad classification 
and there is a wealth of literature on feature-based narrow-phase collision detection algorithms, and 
many different approaches have been investigated: polygonal intersection [Moore et ah, 1988], Lin-Canny 
[Ponamgi et al., 1997], V-Clip [Mirtich, 1998b], SWIFT [Ehmann et al., 2001], and recursive search meth¬ 
ods [Sundaraj et ah, 2000], just to mention a few. 

16.1 CULLIDE 

The naive approach to the collision detection problem would be to take every geometric primitive and 
test it against every other primitive, which leads to a brute-force, all-triangle pair clipping algorithm, with 
complexity 0 (n 2 ). 

Quadratic complexity algorithms arc unattractive in physics-based animation; nevertheless, exhaustive 
pair testing is not dependent on spatial structures and is generally applicable to arbitrary complex shapes 
with all kinds of degeneracies, such as holes, open boundaries, flipped faces, etc. 

Exhaustive search algorithms make no real distinction between handling self-intersections and object- 
object intersections, leading to improvements investigated in the literature. 

To avoid the all pair testing, the ideas of broad-phase collision detection can be exploited; for in¬ 
stance, each primitive could be bounded by a single AABB followed by a sweep and prune algorithm (see 
Section 12.3) in order to quickly find potential overlapping triangle pairs. This approach appeal's to be 
superior, since the method should run an expected 0 (n ) time, and since it’s generally applicable to both 
deformable objects, open surfaces, and even triangle soups. However, it useless for the following reasons 
[Hirota, 2002]: We need to keep close-pair counters for counting the number of axes that two AABBs 
overlap, and the storage requirement is likely to be ()(n 2 ) due to the neighborhood relation of primitives. 
For volumetric meshes such as tetrahedra meshes, every tetrahedra inside an object shares triangular faces 
with at least four other neighboring tetrahedra. AABB boxes for volumetric meshes will thus overlap by 
this structural relationship, and a lot of false AABB overlaps will be reported. 

Graphics hardware may be used to speed up collision detection, and in [Govindaraju et al., 2003] 
C ULL IDE is proposed to prune primitives from a potentially colliding set (PCS) as an efficient prepro¬ 
cessing step prior to an exhaustive search. 

Initially, the PCS contains all objects. All elements of the PCS are then rendered in a two-pass algo¬ 
rithm, first in a forward order and then in reverse order. 

While rendering objects, visibility queries are done to remove elements from the PCS, which is known 
not to be colliding with any other elements in the PCS. After the second pass, fully visible objects are 
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removed from the PCS. This strategy is applied iteratively until no more changes arc made to the PCS. 
Hereafter, one can perform exact collision detection on the primitives that arc left in the PCS. 

Other algorithms based on stencil and/or depth testing also use graphics hardware to accelerate col¬ 
lision detection [Hoff III et al., 2001, III et al., 2002, Heidelberger et al., 2003, Heidelberger et al., 2004]. 
These algorithms require read back of either the stencil and/or depth buffers, which can be expensive on 
commodity graphics hardware. 

The CULLIDE algorithm is based on visibility queries, computed by image-space occlusion queries, 
which arc supported on most graphics hardware. Occlusion queries have low bandwidth, unlike buffer 
read-backs, and are therefore cheap to perform on commodity graphics hardware (for more specific details 
see [SGI, 2005]). An occlusion query returns the number of rasterized fragments, which passed the depth 
test. Setting up the prober depth test, it is thus possible to count how many pixels are occluded by other 
objects already rendered onto the screen. 

Given an environment of n objects, 0\, O 2 , ■ ■ ■, O n , each represented as a collection of triangles, we 
want to find out which objects overlap and compute the overlapping triangles. If an object 0, is not in 
the PCS, then O, does not collide with any object in the PCS. This property is used to prune object pairs, 
which need to be considered for exact collision detection. Given a set S of objects, we test the relative 
visibility of an object O with respect to S, by testing if any parts of O are occluded by S. If we rasterize 
all objects in S, then O is considered fully visible when all fragments generated by rasterization of O have 
a depth value less than the corresponding pixels in the frame buffer, where we assume that depth values 
increase as we go into the screen. If O were colliding with something in S, then some fragments must be 
behind those rasterized from S, when rasterizing O. This is not the same as saying that we always have a 
collision when fragments of O are found to be behind those from S. On the other hand, if we know that 
all fragments of O are in front of all those from 5, then O can not be colliding with anything in S. This 
observation leads to the following lemma: 

Lemma 16.1 (Noncolliding Object) 

An object O does not collide with a set of objects S, when O is fully visible with respect to S. 

We will use this lemma to prune objects from the PCS. The naive approach for this would be to test each 
object in the PCS against all remaining objects, however this leads to an 0(n 2 ) complexity. An efficient 
approach can be derived from the following lemma: 

Lemma 16.2 

For n objects, 0 1 ,..., O n , in the potentially colliding set (PCS) and two sets Si-\ = {Oi ,..., O,- 1 } 
and Si -|_i = {Oj_|_i,..., O n }. If object Oi is fully visible both w.r.t. Si -1 and 1 then Oj is fully visible 
w.r.t. the set S = Si -1 U Sj+i. Hence Oi is fully visible w.r.t. to all other objects in PCS 

We can use the Lemma 16.2 to prune objects from the PCS in linear time, by iterating over the objects in 
two passes. When object Oi is treated in the first pass, it is tested if O, is fully visible w.r.t. .Sj_ 1 . After 
having performed the test, we can add (), to Si-\ simply by rasterizing Oi to obtain S t = Si- 1 U Oi. Now 
we are ready to test O t+ \ against S t . In the second pass, the object order is reversed, implying that O, is 
tested against Si + 1 . Adding O, to 1 we get the set S, = ,S' i+ i IJ and we are ready to test object Oj_i 
for full visibility against S t . This two-pass algorithm is shown in pseudocode in Figure 16.1. If an object 
Oi were determined to be fully visible in both render passes, then we know that it is fully visible w.r.t. 
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algorithm reduce( PCS ) 
clearDepthBuffer() 
for each o in PCS do 

DepthTest( GREATER_EQUAL ) 

DepthMask( FALSE ) 

BeginOcclusionQuery () 
render o 

EndOcclusionQuery() 

fullyVisible(o) = (GetQueryObject(o)==NONE) 
DepthTest ( LESS ) 

DepthMask( TRUE ) 
render o 
next o 
reverse PCS 
clearDepthBuffer () 
for each o in PCS do 

DepthTest( GREATER_EQUAL ) 

DepthMask( FALSE ) 

BeginOcclusionQuery () 
render o 

EndOcclusionQuery() 

fullyVisible(o) &= (GetQueryObject(o)==NONE) 
DepthTest( LESS ) 

DepthMask( TRUE ) 
render o 
next o 

remove all fully visible o from PCS 
end algorithm 


Figure 16.1: Reduction of the PCS in CULLIDE. 


both Si -1 and ,5’ J+ 1 . Thus, according to Lemma 16.1 0, can be safely removed from PCS. The pruning 
algorithm has linear complexity in the number of objects in the PCS. 

It can be somewhat difficult to see through the logic of the depth test and depth mask operations in the 
algorithm listed in Figure 16.1, so we will present here a small example, where the PCS consists of five 
objects labeled a, b, c, d, and e (in that order) as seen in Figure 16.2. Notice that the depth values at the 
far and near clipping plane are 2 = 1 and z = 0 During the first render pass: 

• When object a is rendered, all z-values are one. Thus, the greater-than-equal depth test fails on all 
fragments being rendered, and the fully visible flag for a is set. Finally, a is written to the depth 
buffer, and the next object is b. 

• Object b lies in front of a, implying that its z-values arc all less than those currently in the z-buffer. 
Thus b is also set to be fully visible. 

• Finally, object c is rendered and set to be fully visible, since all z-values arc less than those stored 
in the z-buffer. 
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Far 


Near 



z = 1 


z = 0 


Figure 16.2: Small CULLIDE example. 


• When object d is rendered something different occurs: a small paid lies behind object c, so object d 
will not be set to be fully visible since some fragments will fail the greater-than-equal test. 

• Similarly, when rendering object e, some parts arc lying behind objects d, causing fragments to pass 
the greater-than-equal test. 

In the second pass, the objects are rendered in reverse order: 

• Object e will be found to not occlude anything, being the first object that is rendered. However, 
object e was determined as not being fully visible in the first pass, so no changes are made to the 
state of object e. 

• When rendering, object d fragments will pass the greater-than-equal test, no changes are made to 
object d as it was already determined to be not fully visible in the first pass. 

• Object c lies in front of anything else being rendered so far; since it was determined to be fully 
visible in both the first and second pass, it is declared to be a fully visible object. 

• A similar thing happens for object b. 
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algorithm CULLIDE( Objects ) 

PCS = Objects 
do 

reduce(PCS) 

for each o in PCS do 

if o not primitive then 
remove o from PCS 
add subobjects of o to PCS 
end if 
next o 

while non primitive objects exist in PCS 
end algorithm 


Figure 16.3: Main loop of CULLIDE. 


• Object a lies in front of object 6, thus fragments will pass the greater-than-equal test and the state 
of object a is set to be non fully visible. 

When removing objects from the PCS only b and c were determined to be fully visible, thus a, d, and e 
arc left in the PCS. Performing an iteration more will prune object a, and we arc left with only object d 
and e, which will never be pruned from the PCS and in the end will be passed along for exact collision 
detection. 

To minimize the number of triangles that arc up for rasterization, a hierarchical decomposition of 
objects can be devised. Initially each object is enclosed by an AABB. Then the reduce algorithm is 
performed three times, using the three world axis as projection axes. Hereafter the AABB representation 
of the objects can be replaced by a set of subobjects, which arc used in the next reduction step of the PCS. 
Each time a reduction has been performed, objects in the PCS arc replaced by subobjects. Eventually 
objects in the PCS will be decomposed into their respective triangles. The finally output from CULLIDE 
will therefore be a set of potentially colliding triangles. 

CULLIDE could be initialized with each triangle initially being an object in the PCS, however this 
approach will have poor performance. It is beneficial to use a subobject to quickly home in on regions 
of objects with colliding triangles. Since a fully visible subobject may mean that several hundred or even 
thousands of triangles arc pruned in one reduction step. The subobjects can be computed dynamically on 
the fly. For instance, by collecting a subset of k triangles making up the object into a single subobject, 
another k triangles into another subobject, and so forth. 

To recap the idea is to iteratively continue decomposing objects in the PCS into subobjects, until all 
objects in the PCS consist of a single primitive, i.e., a single triangle. Figure 16.3 illustrates the idea in 
pseudocode. 

CULLIDE has many benefits, no precomputed data structures arc required, no topology information is 
needed, no temporal coherence is exploited. Thus, CULLIDE is applicable to both static and deformable 
objects—even flying triangle soups can be dealt with. Furthermore, the triangle count is also allowed to 
change from iteration to iteration. Many factors have an impact on the overall algorithm, object count, 
subobject strategy, number of triangles, projection direction used during the reduce algorithm, etc. We 
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refer the interested reader to [Govindaraju et al., 2003] for details on a performance analysis of CULLIDE. 
In [Boldt et al., 2004], the CULLIDE algorithm has been extended to handle self-intersections. There are 
however, serious performance issues that must be addressed. 

16.2 Optimal Spatial Hashing 

In the previous section, we saw how the brute-force of graphics hardware was exploited to speed up naive 
exhaustive searching to prune unneeded primitive pair tests. In this section, we will present optimal spatial 
hashing [Teschner et al., 2003]. This algorithm does not use graphics hardware; instead, spatial locality 
is exploited. The algorithm is designed for deformable objects represented by a tetrahedra mesh, but it 
can be equally well applied to a triangular mesh or soup. The conversion is straightforward and we leave 
this as an exercise for the reader, optimal spatial hashing uses a two-pass algorithm. First, vertices arc 
mapped into a hash map, and in the next pass, tetrahedras arc tested against the hash map to find potential 
colliding vertices. The novelty is that it is not the tetrahedra that arc stored into the hash map as is done in 
other gridding algorithms as suggested in BucketTree 
[Ganovelli et al., 2000], 

Given an infinite uniform rectilineal - grid in 3D space, characterized by a user-specified grid cell size, 
l, we can discretize any given point, r = [x, y, z] T , in space 
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The integer coordinate, p, can be thought of as a 3D index, mapping the point r into a uniquely determined 
index. A full 3D grid of cells would require a prohibitive amount of storage, so instead, we will use a ID 
hash map and a hash function, hash(-), which maps a discretized point into a ID index. The hash function 
suggested in [Teschner et al., 2003] is 

hash(p) = (xp\ xor yp 2 xor zpf) mod n, (16.2) 

where pi, p 2 , and p% are large prime numbers, e.g., 73856093, 19349663, and 83492791, as used in 
[Teschner et al., 2003], The value n is the hash table size, ideally also a prime number. Although in 
[Teschner et al., 2003], the hash table size varies according to 99, 199, 299, etc. There is no guarantee that 
discretized points mapped to two different grid cells will not map to the same hash cell. Traditionally, this 
is called a collision. 

The algorithm makes two passes; a first pass discretizes the position of each vertex of the tetrahedra 
using (16.1). The vertices are then stored into the hash cells using (16.2). In the second pass, the minimum 
enclosing AABB is found for each tetrahedron. This AABB is defined by two points in space, a minimum 
and maximum coordinate point. Using (16.1), a span of discretized points covered by the enclosing 
AABB is calculated. For each discretized point in this span the hash function is used to find the hash cell, 
containing vertices mapped into the same grid cell as the discretized point currently being treated. 

All nodes in the found hash cell are then tested for inclusion in the tetrahedron currently being treated. 
If a node is part of the tetrahedron, it is trivially rejected. However, if the node is not part of the tetrahedron 
currently being treated, and the position of the node is inside the tetrahedron, a contact point is reported. 
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class HashCell 
Timestamp T 
list<Node> nodes 

class HashMap 
Timestamp T 
vector<HashCell> cell 


Figure 16.4: Hash map data structure. 


To avoid dealing the entire hash table before each collision query, a clever time-stamping strategy can 
be applied instead. Each query is given a unique time-stamp; similarly, each hash cell has a time-stamp. 
When a vertex is mapped to a hash cell, it is first tested if the time-stamp is different from the current 
time-stamp of the query. If so, the hash cell is cleared before adding the vertex to the hash cell and the 
time-stamp of the hash cell is updated to that of the current query. In the second pass, one must make sure 
to only test against hash cells spanned by the AABB, which have a time-stamp equal to the current query. 

The point in the tetrahedron inclusion test is done by computing the barycentric coordinates of the 
point also used in Section 15.8. The barycentric coordinates, w i, W 2 , m 3 , and W 4 of the point r w.r.t. to 
the tetrahedron defined by the vertex positions p lt p 2 , p 3 ,and p 4 is given by 

r = wip 1 + w 2 p 2 + w.iP:i + u; 4P 4 ) (1.6.3) 

where 

w\ + w 2 + m 3 + W4 = 1, (16.4) 

which is trivially solved. Observe that one only needs to store three of the barycentric coordinates. The 
other is implicitly given by the last constraint. Having found the barycentric coordinates, the given point 
lies inside the tetrahedron if 


w\ > 0, W2 > 0, m 3 > 0 and w\ + W2 + m 3 < 1. (16.5) 

Here we assumed that m 4 was given implicitly by the other coordinates. Figure 16.4 illustrates the data 
structures in pseudocode. The first pass is shown in pseudocode in Figure 16.5. The second pass is given 
in pseudocode in Figure 16.6. The overlap algorithm is shown in Figure 16.7. 

The edge tetrahedron intersection case is ignored in the Optimized Spatial Hashing algorithm. At first 
hand, this might seem flawed. However, according to [Teschner et al., 2003], there arc two reasons for not 
considering edge intersections. First, the performance of the algorithm would decrease substantially, while 
the relevance of the edge test is unclear - in the case of densely sampled objects. Second, the algorithm was 
developed to be integrated into physically based environments, where penetrating vertices are easily dealt 
with, whereas edge intersections are uncommon and costly. 

Obviously, the first pass of the algorithm is linear in the number of vertices, n, since both the dis¬ 
cretization and the hash function takes 0(1) time, The second pass is more difficult to analyze. Clearly 
the number of tetrahedra is not going to be larger than the number of vertices, since four vertices are used 
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algorithm first-pass(H:HashMap,VrVertices) 
for each v in V do 

p = discretized(v.pos) 
h = hash-func(p) 

if H.cellfh].T not equal H.T then 
H.cell[h].nodes.clear() 

H.cell[h].T = H.T 
end if 

H.cell[h].nodes.add(v) 
next v 

end algorithm 


Figure 16.5: Pseudocode for the first pass. 


algorithm second-pass(H:Hashmap,T:Tetrahedra) 
for each t in T do 
A = AABB(t) 

m = discretized(A.min) 

M = discretized(A.max) 
for each p, m <= p <= M do 
h = hash-func(p) 
if H.cell[h].T equal H.T then 
for each n H.cell[h].nodes do 

if n not part of t and n inside t then 
create contact(n,t) 
end if 
next n 
end if 
next p 
next t 

end algorithm 


Figure 16.6: Pseudocode for the second pass. 


algorithm spatial-hashing(H:Hashmap,T:Tetrahedra, V:Vertices) 
inc H.T 

first-pass(H,V) 
second-pass(H,T) 
end algorithm 


Figure 16.7: Pseudocode for the spatial hashing algorithm. 
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to define each tetrahedra. If p is the average number of cells intersected by a tetrahedron, and q is the 
average number of vertices per cell, then the complexity is of order 0(npq). If the cell size is chosen to 
be proportional to the average tetrahedron size, then p is constant under that assumption that the aspect 
ratio of the tetrahedra does not vary too much. If there are no hash collisions, then the average number of 
tetrahedra per cell is constant and so is q, since there arc at most four times as many vertices as tetrahedra 
in a cell. Since p and q can be regarded as constants, the complexity of the algorithm is linear - in the 
number of primitives, that is, 0 {n). 

In [Teschner et al., 2003] numerical results are given supporting the following statements: 

• The method is linear in the number of primitives and not objects. 

• Hash table size influences performance; it should be large enough to avoid too many colliding 
entries in the hash map. 

• Grid cell size should be set to the average tetrahedra edge length to achieve the best performance. 

The problem of varying aspect ratio is not mentioned in [Teschner et al., 2003]. Most likely because it 
is not encountered in the test cases, where all tetrahedra appeal's to be of uniform size implying that the 
enclosing AABBs arc cubes of the same size, ideally suited for hashing in a uniform grid as pointed out 
by [Mirtich, 1996]. In [Mirtich, 1996], hierarchical hashing is used to circumvent the problem of large 
variance in aspect ratio. 


16.3 The Voronoi-Clip Algorithm 

To avoid the all triangle-pair exhaustive search comparison feature, tracking algorithms have been pro¬ 
posed, such as Lin-Canny [Lin et al., 1994, Lin et al., 1998], Voronoi-Clip [Mirtich, 1998b], and SWIFT 
[Ehmann et al., 2000a, Ehmann et al., 2000b]. In the following we will present the Voronoi-Clip ( V-Clip) 
algorithm as an example of a feature tracking algorithm. 

The main idea of V-Clip is to iteratively optimize either the minimum distance between two convex 
polyhedra or improve upon the localization of the closest point between the two polyhedra. 

The minimum distance and closest points between the two convex polyhedra arc given implicitly by 
two features: a vertex, an edge, or a face, one from each polyhedra. V-Clip estimates the minimum 
distance and closest points by a feature pair. In each iteration of the algorithm, V-Clip updates the features 
of the feature pair with their neighbors to minimize the minimum known distance or localization of the 
closest points. The algorithm iteratively continues to update these features until the closest point or an 
interpenetration is found. 

Before we can go into details of the V-Clip algorithm, we will need to work out some geometric 
definitions that allow us to more clearly describe features and regions of space around a convex object. 
We will do this in Section 16.3.1. If the reader is familial - with Voronoi regions and the closest feature pair- 
theorem , it should be safe to skip directly to Section 16.3.2, where the V-Clip algorithm is explained in 
detail. 
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16.3.1 Definitions and Notation 

The V-Clip algorithm is concerned with features of a polygonal object that arc either a vertex ( V ), an 
edge ( E ), or a face (F). We consider each feature type to be a closed set, i.e., an edge will include its 
end-points and a face will include its boundary edges. Mathematically, we describe each of the feature 
types as follows: 

• A vertex V is a point v 6 M 3 . 

• An edge, E, is a line running from vertex O to vertex D, and we can parameterize it as follows: 

e(t ) = (d — o) t + o for 0 < t < 1 (16.6) 

• A face, F, is represented by its boundary edges and a face plane, P(F), which we can describe by 

p ■ n — w = 0. (16.7) 

Here p is any point in the face plane, n is the outward pointing face normal, and w is the orthogonal 
distance from the origin to the face plane. 

Note that we will be working with convex polyhedra and the boundary edges of a face will therefore form 
a planar convex polygon without any holes. Furthermore, no coplanar faces arc present in the polyhedra. 
If a mesh contains coplanar faces, then these must be merged into a single convex face. 

Definition 16.1 (Signed Distance Map for a Plane) 

We define the signed distance function from a point p to a plane Q to be 

Dq(p) = p ■ n — w, (16.8) 

where the plane unit normal is n and distance to the origin is w. 

Any given point p may be classified with respect to a plane Q using Dq as: 

Dq (p) < 0 p is in back of Q 
Dq(p) = 0 p is on Q 
Dq (p) >0 p is in front of Q 

The signed distance map can also be used to find the intersection between an edge, E, and a plane, 
Q. By substituting the edge parameterization into the definition of the signed distance map, an analytical 
expression for the signed distance map of any point along the edge may be derived using (16.6) as follows: 


D Q (e(t)) = D Q ((d - o)t + o) (16.9a) 

= ndt — not + no — w (16.9b) 

= n ■ dt — wt — n ■ ot + wt + n ■ o — w (16.9c) 

= (n ■ d — w)t — (no — w)t + (n ■ o — w) (16.9d) 

= (D Q (d) - D Q (o))t +D q (o). (16.9e) 
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At the intersection point between the edge and the plane, Dq{e{t)) = 0, which means that 


0 = (D Q {d) - Dq(o)) t + D q {o ), 

■ _ Dq(o) 

Dq(o) — D Q (d) 


(16.10a) 

(16.10b) 


Hence, we only need to compute the t- value corresponding to the intersection point with the plane as long 
as the edge is not parallel to the plane. For a given t-value, we know: 

t < 0 The staring point o is the closest point on the edge to the plane Q. 

0 < t < 1 Point of intersection is e(t). 
t > 1 The end-point d is closest point to the plane Q. 

Definition 16.2 (Edge-Plane Intersection) 

Given a plane Q and a noncoplanar parameterized edge 


e(t) = (d — o)t + o, 


(16.11) 


where 0 < t < 1, then the intersection point between the edge and plane is given by the parameter t, 


DqH 

Dq(o) - D Q (d) ’ 


(16.12) 


as long as 0 < t < 1. 

It is common practice to define a topological neighborhood relation between features such that only 
features in touching contact arc neighbors. We will, however, require a more strict definition of neighbor¬ 
hood. 

Definition 16.3 (Feature Neighborhood) 

The neighbors of a feature are defined to be 

Vertex all incident edges 

Edge the two incident faces and the two end-points 
Face all boundary edges 

In our definition of neighborhood relationship we have implicitly assumed that all polyhedra arc closed 
twofolds; that is, we have a watertight object surface, where each edge is shared by exactly two faces. 
Notice that the neighborhood relation definition is a symmetric definition, meaning that if feature A by 
definition is a neighbor of feature If then feature B is also by definition a neighbor of feature A. As 
implied, our neighborhood relation definition is a strong definition. A weaker definition could be made 
where faces and vertices arc also allowed to be neighbors. 

Our neighborhood relation definition is not more prohibitive than the weaker definition, in the sense 
that given two arbitrary features on the polyhedra surface, it is possible to walk from one feature to the 
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other, by going from neighbor to neighbor. The difference is that the path using the weak definition will 
be shorter in some cases. 

The concept of a Voronoi diagram can be extended to a polyhedral object: in a Voronoi diagram a cell 
(or region) of a site point delimits a subspace, where all points arc known to lie closer to the site than any 
other site. This kind of information is useful for locating sites closest to a given point. Polyhedral objects 
do not pose point sites, but features; the definition below therefore extends the Voronoi concept to include 
polygonal features. 

Definition 16.4 (External Voronoi Region) 

An external Voronoi region VR(X) of a feature, X, from a polyhedron, O, is defined as: 

The set of points outside O lying closer or just as close to X than to any other feature. 

It is important to notice that we only consider the space lying outside an object. Although it is possible 
to define internal Voronoi regions as well [Lin et ah, 1998], we will not need those for our treatment of 
V-Clip. 

Introducing a distance function D(p , X), which gives the minimum distance between a point p and a 
feature X, allows us to express the definition of an external Voronoi region in more mathematical terms 
as 

VR(X) = where D(p, X) = min {D(p, Xi)} and (16.13) 

We will now define the concept of Voronoi planes, which will turn out to be very useful for describing the 
boundary of the external Voronoi regions. 

Definition 16.5 (Voronoi Plane) 

A Voronoi plane separates the Voronoi regions between two neighboring features, X and Y. We define it 
to be 

VP{X, Y) = VR{X) n VR(Y) (16.14) 


Notice that any point p, which belongs to VP(A, Y), has the same minimum distance to both X and Y. 

It is easily seen from our definition of neighborhood relation and the fact that features are closed sets, 
that a convex polyhedron has two types of Voronoi planes : edge-vertex Voronoi planes and edge-face 
Voronoi planes. 

Definition 16.6 (Edge-Vertex Voronoi Plane) 

The edge-vertex Voronoi plane, VP(E, V) is the plane with normal parallel to u = o — d and with point 
v on it. 

Definition 16.6 is demonstrated in Figure 16.8. 

Definition 16.7 (Edge-Face Voronoi Plane) 

The edge-face Voronoi plane, VP(E, F), is the plane with normal orthogonal to the normal of F and with 
all points of E on it. 
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o 

Figure 16.8: Edge-vertex Voronoi plane. 

rVF n\ 



Figure 16.9: Edge-face Voronoi plane. 


Definition 16.7 is demonstrated in Figure 16.9. There are some properties of the two kinds of Voronoi- 
planes that are worth more investigation, since they will have an impact on the details of the V-Clip 
algorithm. 

An edge-vertex Voronoi plane, \P(E, V), divides space into two point sets: a set where all points 
definitely lie closer to E than to V, and another set where the points lie just as close to E as to V. This 
is shown in Figure 16.8. An edge-face Voronoi plane, VP (E, F ), also divides space into two point sets: a 
set where we will certainty know that all points lie closer to F than to E, and another set where we just 
do not know whether F or E is the closest feature. This is illustrated in Figure 16.9. By our definition 
of features, we have three different kinds of Voronoi regions: a vertex Voronoi region (VR(V)), an edge 
Voronoi region (VR (E)), and a face Voronoi region (VR(Fj). We will now give precise definitions of 
how these three kinds of Voronoi regions are represented by Voronoi planes. 

Definition 16.8 (Vertex Voronoi Region) 

The boundary of the vertex Voronoi region, VR( V), is given by the edge-vertex Voronoi planes, VP{Ei, V), 
between the vertex, V, and all incident edges, Ei, to V. We choose the convention that normals are 
pointing into the vertex Voronoi region. 

Definition 16.8 is demonstrated in Figure 16.10. 
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Figure 16.10: A vertex Voronoi region and the edge-vertex Voronoi planes defining its boundary. 

Definition 16.9 (Edge Voronoi Region) 

The boundary of the edge Voronoi region, VR(E), is represented by two edge-vertex Voronoi planes, 
VP(E. O ) and VP(E, D), and two edge-face Voronoi planes, VP(E, E/)s. We choose the convention that 
normals are pointing out of the Voronoi region. See Figure 16.11. 

Definition 16.10 (Face Voronoi Region) 

The boundary of the face Voronoi region, VR(F), is given by the edge-face Voronoi planes, VP(E t , E), 
where the EiS are the boundary edges of F, and the face plane of F. We choose the convention that 
normals are pointing into the Voronoi region. See Figure 16.12. 

In our precise representation of the three Voronoi regions, we have carefully stated the orientation of the 
Voronoi planes making up the boundaries between the Voronoi regions. This was done in such a manner 
that the normal direction of one region is not conflicting with the normal direction of neighboring Voronoi 
regions. 

We now have all the necessary geometric background to state the main theorem, which the V-Clip 
algorithm relies on. 

Theorem 16.3 (Closest Feature Pair) 

Let A and B be two convex polyhedra and let P be a feature from A and Q a feature from B. Now let the 
points p G P and q G Q be the closest points between P and Q. If 


p G VR(Q) and q G VR(P) 


(16.15) 
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Figure 16.11: An edge Voronoi region and the Voronoi planes defining its boundary. 


then no other points from A and B lie closer to each other than p and q. In which case we say that P and 
Q are the closest features between A and B. 

The proof of this theorem follows trivially from contradiction and the properties of the Voronoi regions 
(see [Mirtich, 1998b] for details). Figure 16.13 shows some examples illustrating the theorem. 


16.3.2 Overview of V-Clip 

V-Clip performs a greedy search for two features (A", Y) that fulfill the Closest feature pair theorem. 
Theorem 16.3. Throughout the algorithm, V-Clip keeps track of a current feature pair (A, Y). The 
current feature pair is an estimate for the closest proximity between the two polyhedra, especially the 
closest distance between the two features D{X,Y) is an estimate for the closest distance between the 
two polyhedra. In each iteration, V-Clip incrementally replaces either A or Y with one of their respective 
neighbors. The replacement is only done if the neighboring feature either decreases the currently estimated 
minimum distance or improves upon the localization of the closest points. 

If we have a feature pair (A, Y), and we are about to update A with Z, then 


(X,Y) 


(Z,Y) =f> 


D(Z,Y) < D(X,Y) 
D(Z,Y) = D(X,Y) 


Ifdim(Z) > dim(A), 
Ifdim(A) < dim(A), 


(16.16) 
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Figure 16.12: A face Voronoi region and the Voronoi planes defining its boundary. 


where dim(-) denotes the dimension of a feature. In other words, if dim(Z) is greater than dim (A), the 
minimum distance will decrease, if dim(Z) is less than dim(x) the minimum distance is unchanged, but 
the localization of the closest points between the polyhedra is improved. Figure 16.14 shows examples 
clarifying these statements. In Figure (a), a face F is updated with one of its boundary edges, E ', i.e., a 
lower dimensional feature; the minimum distance between the feature pairs remains unchanged, but we 
now have a better idea of where we should look for the closest point. In Figure (b) we have a vertex, 
V, making up the initial feature pair with E. By our neighborhood relation definition, the only possible 
updates would be to replace the vertex with one of its incident edges, E' or E". Hence, regardless of 
which edge is used to update the vertex, the minimum distance between the feature pair becomes smaller. 

Our definition of neighborhood relation guarantees that we never have a face-face feature pair, (F, F). 
Thus, we have only five possible states that the algorithm can be in as depicted in Figure 16.15. In the 
figure, the possible feature updates that can occur are shown. A dotted arrow indicates a lower dimensional 
update, and a solid arrow indicates a higher dimensional update. From the figure, it is intuitively clear 
that V-Clip will terminate, since each update aggressively improves the answer, and since there are only a 
finite number of features, so there can only be a finite number of updates. 
















(a) (b) 


Figure 16.13: Examples of the Closest feature pair theorem. In Figure (a) the closest points from edge P 
and vertex Q lie inside each of the other Voronoi regions, implying that P and Q arc closest features. In 
Figure (b) this is no longer true. 

16.3.3 Edge Clipping 

The core of the algorithm is based on the use of edge clipping to determine a neighboring feature with 
which to perform an update. An edge E is clipped against the Voronoi planes of a Voronoi region VR(V). 
Since E runs from origin O to destination D, it is possible to determine whether an intersection point with 
a Voronoi plane of V R (V) indicates that the edge is entering or leaving the Voronoi region. We will also 
remember the neighboring features causing the Voronoi planes that the edge intersect, when it enters and 
leaves the Voronoi region, V R (.V). 

If any intersection points arc found, these indicate possible updates. In Figure 16.16 a simplified 
conceptual drawing is shown clarifying the idea behind the edge clipping. As can be seen in the figure, 
the Voronoi plane intersection indicates that feature X should be updated with its neighboring feature Z 
and not W. Of course the update should only be performed if the intersecting feature either decreases 
the minimum distance or improves upon the localization. Eater we will present methods for testing these 
requirements. The edge clipping can thus be seen as a heuristic that guides the greedy search of possible 
neighboring features with which to update the current feature pair. 
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Figure 16.16: Edge Y is clipped against Voronoi planes of X: VP(Jf, W) and VP (X,Z), edge Y is 
intersecting Voronoi plane defined by feature Z , thus Z is a possible feature with which to update X. 


The actual edge clipping is nothing more than a simple Cyrus-Beck clipping [Foley et al., 1996]. Fig¬ 
ure 16.17 shows the edge-clipping algorithm in pseudocode. Notice that the sign of the distance on the 
end-points are used to determine whether the intersection point is an entering point or a leaving point. 
Observe further that the algorithm iteratively updates the best known entering and/or leaving point by 
keeping track of the parameter /-value for the intersection point. 

When an edge is tested against a Voronoi region, there can be four different cases: 

• Either the edge is fully enclosed inside all the Voronoi planes of the Voronoi region or the edge is 
partially enclosed, meaning that it is sticking out through at least one of the Voronoi planes. 

• It might also be the case that the edge is excluded due to lying entirely on the outside of a Voronoi 
plane. We call this simple exclusion. 

• The last possibility is that the edge lies outside the Voronoi region, but in such a way that it is not 
simply excluded. We call this compound exclusion. 


The four cases arc shown in Figure 16.18. The results from the edge-clip algorithm shown in Figure 16.17 
can be used to determine which of the four cases we arc dealing with. 


clip-edge =>■ 


tin — 0, t ou t — 1 1 Xj n — Nfyut — Hull 
0 V ti n <C t ou t V 1, Nin N Q ut 7# null 

tin — 0? tout — 1 ? jVj n — Nout y- null 


tin > tout 


If enclosed 
If partial enclosed 
If simple exclusion 
If compound exclusion 


(16.17) 
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algorithm clip-edge (E,X, S,tin = 0 ,t out = 1 ,Ni n = N out = £ ) 
for all P € S do 

N = neighbor feature of X that causes P 
do = signx^E' * D(P, E.O ), d D = sign Xl tE' * D(P, E.D) 
if do < 0 and do < 0 then 
N in = N out = N 

return false 
else if do < 0 then 
t = do/{do - d D ) 
if t > ti„ then 

tin = t, Nin = N 

if tin > tout, then return false 
end if 

else if do < 0 then 
t = do/(do — do) 
if t < tout then 

tout — 6 N ou t — N 

if tin > tout then return false 
end if 
next P 
return true 
end algorithm 


Figure 16.17: Edge-clipping pseudocode. 


Depending on the case, it is possible to devise a feature update strategy. In the case of a partial 
enclosed edge, we have two /-values, tj n and t out , indicating points on the edge, where a feature other 
than X might be closer to E. The possibilities arc as follows: either there is no intersection with a Voronoi 
plane, in which case the /-value corresponds to an end vertex of the edge itself, and which suggests that 
the edge itself should be updated with the vertex, or a /-value might correspond to an intersection between 
E and some VP (N,X), which may indicate that X should be updated with its neighbor, N, constituting 
the intersecting Voronoi plane. 

16.3.4 The Derivative Test 

Given a pair of /-values from the edge-clipping algorithm described previously, we will now present 
a simple test to determine whether the corresponding features will decrease the minimum distance or 
improve upon localization. 

Looking at the sign of the derivative of the distance function between E and X, D'(X, e(/)), the best 
feature to update X with can be determined as summarized in Table 16.1. The idea is that if the derivative 
of the distance function is negative, then the distance is decreasing, if the sign is positive, then the signed 
distance is increasing. Hence, to minimize the minimum distance we want to pick features where the sign 
of the derivative of the distance function is negative. In the case of full enclosement, there is no need to 
update X, but perhaps we need to update E. This is done by picking the end vertex of E with the smallest 
distance to X. The entire strategy of determining what feature to update based on the derivative of the 
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\ \ 


\ \ 

Simple Exclusion Compound Exclusion 

Figure 16.18: Four different cases for edge intersection with a Voronoi region VR(X) bounded by Voronoi 
planes: VP(X, N\ VP(X, N 4 ). 
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Sign 

Best Interval 

Update 

D'(X,e(t in )) > 0 
D'(X,e(t out )) < 0 

D'(X, e(t in )) < 0 and D'(X , e(t out )) > 0 

[0 ,t in [ 

\^ / OUt') 1 ] 

[tim t 0 ut\ 

J 1^ 

1 1 1 
tqtq'tq 


Table 16.1: Signed derivative tests and corresponding feature updates. 


algorithm derivative-test ( E , X , tm , t out, Am, N mt ) 
if A' not an edge 
<5 in = sign[D'(A, e(t in ]))] 

<5out = sign[A'(X, e(tout]))] 
else 

<5 in = sign[D'(Ai„,e(ti„))] 

<5out = sign[_D'(lV 0 ut,e(tout))] 
end if 

if Ain ^ null and 5m > 0 
return N m 

else if A out ^ null and <5 0 ut < 0 
return A ou t 
end if 
return A' 
end algorithm 


Figure 16.19: Pseudocode showing how to determine what feature X should be updated with. Before this 
test, E has been clipped against VR(X) or a subset hereof. 


signed distance map is called the derivative test. 

If A is a vertex, then an analytical expression of the sign of the derivative of the distance function 
D'(X , e(f)) can be derived 


sign[D'(V, e(t))] = sign[it • (e(t) - «)] 


(16.18) 


If X is a face then 


sign [D'(F,e{t))\ 


+sign[w • n\ If D(F, e(t)) > 0 
—sign[w • n] If D(F, e(t )) < 0 


(16.19) 


Fortunately, it turns out that the case where X is an edge is not needed. The derivative test is used to handle 
full enclosement and partial enclosement; the remaining two cases are simple exclusion and compound 
exclusion. In the case of simple exclusion, we can update with the feature causing the exclusion, except 
in the case of an (E. F) state, due to the edge-face Voronoi plane property, as shown in Figure 16.9. This 
property makes it difficult to determine whether the violating feature is in fact also the closest feature to 
the edge. Figure 16.20 shows the case of simple exclusion. The solution to the problem is to search the 
perimeter of the face in order to find the closest feature to the edge. In most cases of compound exclusion, 
we can reuse the derivative test for doing the update. The pseudocode in Figure 16.19 shows the details 
of the derivative test. 
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Figure 16.20: Face exclusion. Edge E is simply excluded by S i, S 2 , S 3 , S 4 , and Sr ,. A search must be 
done to find the closest feature S 3 . 

16.3.5 Dealing with Penetrations 

Penetration can only be detected in the (V, F) and (E. F) states of the V-Clip algorithm. In the (E, F ) 
case a penetration is detected if the edge intersects the face plane and the intersection point lies inside the 
edge-face Voronoi planes of the face Voronoi Region. The (V, F) state is not obviously dealt with, since 
a local minimum can cause confusion, as shown in Figure 16.21. From the figure it is clear that it is not 
enough to test whether the vertex lies beneath the face plane. If this was the case, then all face planes of 
the object must be searched. If a single face plane is found where the vertex lies in front as depicted in the 
right picture of Figure 16.21, then the current face is replaced with another face of the object where the 
signed distance to the vertex is positive and greater than the signed distance to any other face plane. 


16.3.6 The Internal States of V-Clip 

In the previous sections we have presented the reader with all the technicalities of the V-Clip algorithm. 
We have finally reached the point where the internal workings of the five states in Figure 16.15 can be 
described. 


16.3.6.1 The Vertex-Vertex State 

The vertex-vertex case is dealt with simply by applying a brute-force test using the Closest feature pair 
theorem. Theorem 16.3. That is, one vertex is tested for inclusion by the edge-Voronoi planes of the other 
vertex and vice versa. If a Voronoi plane is found violating the Closest feature pair theorem, the vertex 
is updated to the corresponding edge defining the violating Voronoi plane. The pseudocode is shown in 
Figure 16.22. 
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Figure 16.21: Local minimum problem of the (V, F) case; in the left and in the right picture V lies inside 
the Voronoi planes of F ; in the left image V is also clearly beneath the face plane, and it is sensible to 
conclude that a penetration has occurred. However, the same arguments apply to the right picture, but no 
penetration occurs here. 


algorithm vertex-vertex-state(Vi, V 2 ) 
for all P = VP (Vi, E) € VR(Vi) do 
if D(P, V 2 ) < 0 then 
Vi >-> E 

return continue 

repeat all of the above with Vi and V 2 interchanged 
return done 
end algorithm 


Figure 16.22: Pseudocode for the vertex-vertex state of V-Clip. 


16.3.6.2 The Vertex-Edge State 

In the vertex-edge state, the vertex is tested for inclusion of the Voronoi planes making up the edge- 
Voronoi region. If a violating plane is found, then the edge is updated to the neighboring feature defining 
the violating Voronoi plane. If the vertex is determined to lie inside the edge Voronoi region, then the 
edge is clipped against the Voronoi region of the vertex. Afterward, it is determined which one of the four 
edge intersection cases is being dealt with, and an appropriate action is taken as explained previously in 
Section 16.3.3 and Section 16.3.4. Figure 16.23 illustrates a pseudocode version of the vertex-edge state. 


16.3.6.3 The Vertex-Face State 

In the vertex-face state, it is first tested if the vertex lies inside the Voronoi places of the face Voronoi 
region. If the vertex is found to lie outside a Voronoi plane, a simple exclusion must have been found, 
and the face can be updated to the edge defining the violated Voronoi plane. If the vertex is found to lie 
inside all the Voronoi planes of the face Voronoi region, it is tested whether any of the incident edges of 
the vertex is pointing toward the face. If such an edge exists, going along it will decrease the minimum 
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algorithm vertex-edge-state (V, E ) 
for all P = VP{E,N) £ VR (E) do 
if D{P, V) > 0 then 
EieJV 

return continue 
end if 

clip-edge (E, V, VR(V), U„ , tout, A/m , A /out ) 
if E simply excluded then 
V He Nin 
else 

derivative-test (E, V, tin, tout, Ni n ,N out ) 
if V is updated then 
return continue 
else 

return done 
end if 

end algorithm 


Figure 16.23: Pseudocode for the vertex edge state of V-Clip. 


distance to the face. Thus the vertex is updated to the edge pointing toward the face. 

If all of the above tests pass, we know that neither the vertex nor the face can be updated, and a last 
test is performed where the signed distance of the vertex w.r.t. the face plane is computed in order to 
figure out if a separation has occurred, in which case a final state of the algorithm has been found. If the 
vertex is found to lie beneath the face plane, we can either have a penetration or a local minimum (see 
Figure 16.21). If we have penetration, we have reached a final state and the algorithm terminates. If a local 
minimum is detected, the face is updated with another face and V-Clip iterates once again. A pseudocode 
version of the vertex-face state is given in Figure 16.24. 

16.3.6.4 The Edge-Edge State 

The edge-edge state is specially compared with the other states. Here each edge is clipped against the 
other Voronoi region of the other edge, but the clipping is done in a two-pass manner. 

First an edge is clipped against the edge-vertex Voronoi planes of the other edge. These are known to 
be parallel, thus only a simple exclusion is done or a derivative test must be performed. This is illustrated 
in Figure 16.25. Notice that if E : > were fully enclosed, the derivative test would not perform any update. If 
no update occurs in the first pass, then the edge is clipped against the edge-face Voronoi planes of the other 
edge in the second pass. Again, either a simple exclusion is found or a derivative test must be performed. 
Figure 16.26 illustrates the idea in pseudocode. 

16.3.6.5 The Edge-Face State 

The last remaining state in the V-Clip algorithm is the edge-face state. Here the edge is first clipped against 
the Voronoi planes of the face. If either a simple or compound exclusion is found, a search along the face 



“book” — 2005/9/30 — 15:44 — page 550 — #562 


550 


CHAPTER 16. FEATURE-BASED ALGORITHMS 


algorithm vertex-face-stat e(V,F) 
for all P = VP (F,E) £ VR (F) do 
if D(P , V) < 0 then 
F E 

return continue 
end if 
P = plane(F) 

for all neighbors E to V do 

V' =endpoint of E not equal to V 
if D(P, F)sign [D(P, V ) - D(P, V ')] > 0 then 
return continue 
if D(P , V) > 0 then 
return done 
return localmin (V, F) 
end algorithm 


Figure 16.24: Pseudocode for the vertex face state of V-Clip. 



Figure 16.25: Figure showing that edge-vertex Voronoi planes of edge E\ arc parallel.Two things might 
happen when clipping another edge against these planes: either it is simply excluded as E 2 or it is not as 
£ 3 , in which case a derivative test is needed. 
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algorithm edge-edge-state(Ei, E2 ) 

S = edge-vertex Voronoi planes of Ei 
clip-edge (E2, E 1, S, tin , tout , Nin , IV out ) 
if E2 simply excluded by VP(Ei, V) then 
.Ei 1—* V 
else 

derivative-test (E2, E\,U n , tout, Nin,N ou t ) 
if Ei was updated then 
return continue 

S = edge-face Voronoi planes of Ei 

//Very important do not reset 1 -values and IV-values 
clip-edge (E2, Ei, S, tin , tout, Nin , IV out) 
if E2 simply excluded by VP(Ei, E) then 
Ei t—> F 
else 

derivative-test (E2, E\,U n , tout, Nin,N ou t ) 
if Ei was updated then 
return continue 

repeat all of the above withEi and E2 interchanged 
return done 
end algorithm 


Figure 16.26: Pseudocode for the edge-edge state of V-Clip. 


perimeter is initiated to determine the closest feature to the edge, and the face is updated to that feature. 
In the case that the edge is fully or partially enclosed, it is tested if the edge intersects the face plane, in 
which case the algorithm terminates in a penetrating state. 

If the edge does not intersect the face plane, then the sign of the derivative of the signed distance map 
is used to determine the course of action. This is done by testing whether the sign of the derivative of the 
signed distance indicates that the minimum distance decreases when one moves along the edge toward the 
face. If so, the edge is updated with the end-point that one is moving toward. Notice that if the minimum 
distance does not decrease, it means that the edge is partially enclosed and it intersects a Voronoi plane 
defined by a boundary edge of the face. The face is thus updated to the boundary edge. The pseudocode 
version of the edge-face state is shown in Figure 16.27. 

16.3.7 V-Clip in Perspective 

V-Clip is an improvement of the Lin-Canny algorithm [Lin et al., 1994]. Given the current feature pair, 
the Lin-Canny algorithm works by computing the actual closest points between one of the features and 
the neighboring features of the other feature. The other feature is then updated to the neighboring feature 
having a smaller distance to the closest point. V-Clip has three advantages over the Lin-Canny algorithm. 
V-Clip never computes the actual closest points between feature pairs or the distance between them. This 
makes V-Clip more robust and stable toward numerical precision and round-off errors. Furthermore, 
numerical experiments indicate that V-Clip is faster than Lin-Canny. The final benefit comes from V-Clip 
being able to handle intersecting objects. The original Lin-Cannny would enter an infinite loop if objects 
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function edge-face-state (E, F ) 

clip-edge (E,F, VR(F) - plane(F),t in ,tout, N in , N out ) 
if E excluded then 

F i-e closest edge or vertex on F to E 
return continue 
din = D(F, e(tin )) 
dout — D(F, e(t 0 ut)) 
if dindout < 0 then 

return penetration 
if D'(F, e(tin)) > 0 then 
if Nin A null then 
F I—7 Nin 
else 

E He E.o 

else 

if N ou t A null then 
F He N ou t 
else 

E He E.D 
return done 
end algorithm 


Figure 16.27: Pseudocode for the edge-face state of V-Clip. 


were intersecting, however, Lin-Canny were later extended with so-called internal pseudo Voronoi regions 
[Lin et al., 1998], allowing it to handle intersecting objects. A drawback of both algorithms is that they 
only work on convex polygonal objects. 


16.4 Recursive Search Algorithms 

The last feature-based algorithms, we will present are recursive search algorithms [Joukhadar et al., 1996, 
Joukhadar et al., 1998, Joukhadar et al., 1999, Sundaraj et al., 2000]. Recursive search algorithms search 
over the surface mesh to localize contacts between two objects. These algorithms typically exploit the 
topology of the faces in a mesh in order to perform a fast search for contacting faces given an initial 
colliding pair of faces. 

We will describe the algorithm presented in [Sundaraj et al., 2000]. The algorithm needs a pre¬ 
processing step, where another algorithm such as V-Clip, BVH, etc. determines an initial pair of colliding 
faces, these are passed along as input arguments to the algorithm. The algorithm then proceeds in three 
stages. First a contour of collision is found, then inner and outer boundaries arc determined, and finally 
the contact elements arc determined. 

The initial pair of colliding faces arc used to seed the first stage of the algorithm. A recursive search is 
initiated where all neighboring triangle pairs that arc colliding arc traversed in an iterative manner. When 
a colliding triangle pair is encountered during the search, it is added to a set called the contour of collision. 
The pseudocode in Figure 16.28 illustrates an iterative version of the recursive search. Observe that the 
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algorithm contour-of-collision (Fa, Fb :faces) 
list contour 
Queue Q 
Q . push ( Fa,F b ) 
while Q not empty 
(Fa, F b ) = Q. pop 
contour.add (Fa,Fb) 
mark Fa and Fb as contour faces 
for all non-contour neighbors Na of Fa 
for all non-contour neighbors N b of Fb 
if Na collides with N b then 
Q . push (Na, Nb ) 
end if 
next Nb 
next Na 
end while 
return contour 
end algorithm 


Figure 16.28: Recursive search method to find contour of collision. 


topology of the surface mesh is used to recursively trace a contour of colliding faces. Thus, only the exact 
intersecting boundaries of the two objects arc traversed. 

Hereafter, the second stage is initiated. In this state, neighboring faces of the contour of collision arc 
recursively visited and added to a set containing the inner boundary or another set containing the outer 
boundary. To test whether a neighboring face N, of a face A is paid of the inner or outer boundary, the 
face plane of the colliding face B of A is used. If N lies in front of the face plane of B, then N belongs 
to the outer boundary, if N lies behind the face plane of B, then it belongs to the inner boundary. The 
idea is illustrated in Figure 16.29. Here two faces A and B lying on the contour of collision are shown. 
The two neighboring faces N\ and AA of face A are tested against the face plane of face B. N\ is lying 
above the face plane of B and is thus added to the outer boundary, AA lies below the plane and is therefore 
added to the inner boundary. Care must be taken that neighboring faces are not visited more than once, 
since two faces lying on the contour of collision could share the same neighboring face. This is, however, 
easily avoided by flagging faces; also, faces already added to the contour of collision should be ignored. 
A pseudocode version of the second stage is given in Figure 16.30. 

In the last stage of the algorithm, contact elements are recursively searched. Initially we visit all the 
neighboring faces of the inner boundary. If these are not paid of the contour or the outer boundary, then 
they are added as contact elements. Afterward, their neighbors are recursively searched and so on, until 
all contact elements are found. Thus, the contact elements are those faces entirely embedded inside the 
surface of the other object. Figure 16.31 illustrates the idea in pseudocode. 

Looking at the complexity of the algorithm, we see that the first and second stages are linear in the 
number of intersecting faces, whereas the last stage is linear in the number of contact elements. Thus, the 
overall complexity is output sensitive. 

As explained, the algorithm is not addressing the problem of self-intersecting nor is the problem 
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Figure 16.29: Inner and outer boundary test. Neighboring faces of face A arc tested against face plane of 
face B. 


algorithm inner-outer-boundary(contour) 
list inner 
list outer 

for each pair of faces Fa,Fb in contour 
for all non-marked neighbors Na of Fa 
if Na in front of Fb then 
outer . add (Na ) 
mark Na as outer boundary 
else 

inner . add (Na ) 
mark Na as inner boundary 
end if 
next Na 

for all non-contour neighbors Nb of Fb 

... same as above with A and B interchanged 
next Nb 
next Fa,Fb 

return inner and outer 
end algorithm 


Figure 16.30: Search method to find inner and outer boundary of the contour of collision. 
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algorithm contact-elements(inner) 
list contact 
Queue Q 
Q .push(inner) 
while Q not empty do 
F = Q.popO 

for all neighbors N of F 

if N is not contact, outer, inner or contour then 
mark N as contact 
contact.add( N) 

Q .push ( N) 
end if 
next N 
end while 
return contact 
end algorithm 


Figure 16.31: Recursive search method to traverse contact elements. 


addressed in [Sundaraj et al., 2000], however the extension seems trivial. As noted earlier, the algorithm 
needs a preprocessing step feeding an initial pair of colliding faces. In [Sundaraj et al., 2000] this was 
done by using GJK on a convex decomposition of the objects. The result of the algorithm is a set of 
intersecting faces (the contour of collision) and a set containing faces fully embedded inside the surface 
of the other object. Thus the algorithm returns an explicit representation of the entire intersection volume 
between the two objects. It is not obvious how the algorithm can be extended in a general context to work 
with multiple contact areas. In [Sundaraj et al., 2000] every convex component of the decomposition was 
tested against each other to overcome this problem. 

Aside from the problem of seeding the algorithm with pairs of colliding faces from all contact areas, 
the algorithm has the advantage of being applicable to both static and deformable objects. 
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Volumetric representations of objects may be used for doing collision detection. Many algorithms, which 
we have chosen to classify as volume-based, combine the classical geometric algorithms treated in pre¬ 
vious chapters with the so-called image-based techniques or object space algorithms. As indicated, the 
image-based algorithms see objects as 3D images. Often signed distance maps arc used as these 3D im¬ 
ages. However, there arc also volume-based methods, which work on tessellated manifolds. The phrase 
“image-space techniques” refers to objects being rendered into an image (see [Govindaraju et ah, 2003]). 
We have therefore classified volume-based methods as methods which arc not limited to the boundary 
representation of objects. 

There is a large number of algorithms for volume-based methods, and we will therefore limit ourselves 
to present two different algorithms, which illustrates the idea of using a volume-based representation well. 
Most of the volume-based algorithms arc conceptually based on the same idea, but use different techniques 
for computing distance fields, layered depth images, etc. We do not see much point in delving into whether 
stencil buffers, GPUs, or OpenGL extensions arc used for producing the volume-based representations due 
to the rapid development in graphics hardware. 

17.1 Distance Maps 

Implicit functions arc useful for describing the shape of an object; consider the implicit function describing 
the surface of the unit sphere 

/unit sphere (p) — tt T !} 4 ~ Z 1 

This function has the property that given a point p = [x. y, z] T : 

/unit sphere (jp) <0 p is inside the unit sphere 

/unit sphere (p) > 0 p is outside the unit sphere 

/unit sphere (p) = 0 p is on the surface of the unit sphere 

This is beneficial since we have a way to determine in constant time whether an arbitrary point collides 
with the unit sphere. 

A signed distance map is a special case of an implicit function. The signed distance map, </>(•), is 
defined such that given an arbitrary point in space, p. the value o(p) gives the signed Euclidean distance 
to the closest point on the surface of the object. If the point p is outside the object the sign is positive. If 
the point is inside the object the sign is negative. In particular, if the signed distance is zero the point lies 
on the surface of the object. A signed distance map has the property that the magnitude of the gradient is 
one, and the gradient is normal to the object surface. Therefore, a contact normal can be found by using 
the gradient at the given point. 
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It is rare that one can find an analytical expression for the signed distance map for arbitrary convex 
objects. Even in those few cases where this might be possible, such an analytical expression will be far 
too expensive to evaluate. Instead, the signed distance map is represented by sampling it. A popular and 
widespread data structure for doing this is a rectilineal - grid. The object is embedded in this grid, and each 
node in the grid keeps the signed distance to the closest point on the object surface. The sampled signed 
distance function is called a signed distance field or signed distance map. 

Signed distance fields can be computed in many ways, e.g., the fast marching method [Sethian, 1999, 
Osher et al., 2003], characteristic scan conversion [Mauch, 2000], prism scan [Sigg et al., 2003], and dis¬ 
tance meshes [III et al., 2002, Sud et al., 2004] can compute signed distance maps on regular grids. How¬ 
ever, adaptive representations also exist [Frisken et al., 2000], where the signed distance map is stored 
in an octree representation in order to save memory at the price of the lookup cost in octree. In our 
description of the algorithm, for simplicity, we will assume that uniform grids are used. 

Given two objects A and B, the method in [Guendelman et al., 2003] works by taking the vertices 
of A and looking them up in the signed distance function of B. If a negative distance is found, then a 
contact point is generated. Afterward, the vertices of B are looked up in the signed distance map of A. 
In [Guendelman et al., 2003] a double representation of each object is stored and used to handle collision 
detection and contact generation. Each object is represented by a triangular mesh and a signed distance 
map. The method in [Guendelman et al., 2003] is tailored for rigid bodies, and thus a convenient object- 
space frame is set up where the center of mass is at the origin, and the axes aligned with the principal axes 
of inertia. Both the triangular mesh and the signed distance grid cj) are stored in this object-space. This 
means that when vertices of A are looked up in B, then they must be transformed from object space of A 
into object space of B, and vice versa. 

When a given point is looked up in the signed distance grid, it is very unlikely that the point will be 
perfectly aligned with a grid node, and in order to obtain a signed distance value at non-grid node positions 
an interpolation is therefore needed. We use a trilinear interpolation of the eight neighboring grid nodes. 

The algorithm as described is fairly easy to understand and straightforward to implement. One major 
drawback is that it ignores edge-edge intersections: if a coarse tessellation is given, such as 12 triangular 
faces for a cube, only the eight corners will be used to find contact points. This implies that no con¬ 
tact points will be found for the case of two cubes, one resting slightly rotated on top of the other. In 
[Guendelman et al., 2003] it is suggested to handle the problem by intersecting the edges with the zero- 
level set contour and flag the deepest penetrating point on the edge as a sample point. Furthermore, an 
edge-length threshold test can be used to determine which edges should be subjected to this more thorough 
test. 

Several acceleration methods can be applied to speed up a collision query. If more storage is allowed, 
one can store min-max-values and in-outside markers for the signed distance map grid. Thus, one will 
have a quick rejection test to avoid having to do the more expensive trilinear interpolation. This sort of 
extra information has an even more dramatic impact on octree-based grids, allowing a lookup to terminate 
at an early level in the hierarchy. 

The simplicity of the method implies that it is very cheap to test a single vertex for penetration, 
however, since a fine tessellation is needed a large number of vertices is expected, and looking up each 
vertex can degrade performance. To remedy this, vertices can be stored in a bounding volume hierarchy 
(see Chapter 15), where the BVH is tested for overlap with the box enclosing the distance grid. Only those 
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points found to be inside the distance grid box arc actually tested against the distance map. 

Another possible avenue for improving performance is through parallelism, where a single vertex test 
is completely independent of all other vertex tests, and they can thus be performed in parallel. 

A problem encountered in practice with highly tessellated objects is that they generate large contact 
point sets. Often we only want to use contact points at the corners of the convex polygon of a contact area 
between two rigid bodies. The method just described will in most cases generate contact points sampled 
uniformly over the entire contact area. A postprocessing or filtering step must thus be performed to prune 
unneeded contacts. Otherwise the performance of a typical constraint-based rigid body simulation will 
degrade horribly. 

One approach to contact filtering would be to divide contacts into disjoint sets based on their pene¬ 
tration distance and contact normals. That is, only contacts with the same penetration and contact normal 
direction within numerical threshold can be found in the same contact set. After having divided the con¬ 
tacts into disjoint sets, the convex polygon of the contacts in each set is found in a plane orthogonal to the 
common contact normal. All contacts not lying on the vertices of the convex polygon can now be pruned. 
It is possible to prune the contact sets further if a minimum size is wanted, see [Bouma et al., 1993] for 
more details. 

The method we have just described has another benefit from a contact generation viewpoint. Each 
contact is uniquely determined by the triangle mesh vertex that generated it. It is thus easy to uniquely 
identify the same contact point in two subsequent iterations of a simulation, which can be exploited to 
cache temporal information about contacts. For instance, to obtain smooth transitions when objects move 
over discontinuities, see Section 14.1. 

17.2 Layered Depth Image 

A graphics hardware accelerated approach is presented in [Heidelberger et al., 2003] based on computing 
a layered depth image ( LDI ) for the volume of intersection s(VoI) between two objects. 

An LDI of an object is very similar to a scan conversion of a polygon, see [Foley et al., 1996]. A 
scan conversion algorithm scans lines through a polygon, keeping track of when a line enters and when it 
leaves the polygon. These entry and exit points arc used to determine spans across the scan line, where 
pixels should be rasterized. An LDI is very similar, and parallel scan lines arc shot orthogonally along 
an image plane, and the depth of exit and entry points arc recorded in the LDI data structure. A volume 
representation of the Vol is thus created consisting of spans lying inside and outside the object. Figure 
17.1 illustrates the basic idea of an LDI. The algorithm proceeds in three stages: first, the intersection 
volume of the enclosing object AABBs is found. This intersection is itself an AABB if it exists; if it does 
not exist, there is no chance that the two objects collide. The intersecting AABB is called the volume of 
interest (Vol). The first stage is illustrated in Figure 17.2. Second, two LDIs arc computed, one for each 
object in the Vol. The LDIs can be computed efficiently using graphics hardware, to be elaborated on 
below. The LDI computation to a specific representation such as triangular meshes, patches, quads, etc., 
requires watertight object surfaces. Figure 17.3 shows the second stage. Third, the two computed LDIs 
arc used to determine whether a collision has taken place. This can be done in one of two ways. The two 
LDIs can be combined into one LDI representing the intersecting volume of the two objects. If no volume 
is found, it is known that the two objects arc not colliding. On the other hand, if a intersecting volume 
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Figure 17.2: First stage of the LDI algorithm, AABBs are used to find Vol. 
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Figure 17.3: Second stage of the LDI algorithm, LDIs of objects are computed in Vol. 

is found, the objects arc colliding and an exact representation of the overlapping parts of the objects is 
found. The overlap is found in a similar manner as the approach used in the previous section: first, the 
vertices of one object lying inside the Vol is looked up in the LDI of the other object. If the vertex lies in 
an inside span, then a collision has taken place. Second, the vertices of the other object is processed in a 
similar fashion. If a colliding vertex is found, it can be used to generate a contact point at the position of 
the vertex. In Figure 17.4 it is shown, how the LDIs in Figure 17.3 arc combined into an LDI representing 
the intersection volume. To guarantee proper boundary conditions, the LDI is rendered for each object 
from the outside. As shown in Figure 17.2, outside faces of object A arc opposite to outside faces of B, 
and we choose to render the LDI for each object from opposite outside faces, as shown in Figure 17.3. 
This guarantees that the LDI will start from an outside region. In particular, the first layer of points in the 
LDI will be entry points, the next exit points, and so on. 

If one of the objects is fully contained within the other object, the Vol will be equal to the AABB 
of the contained object, and it is thus impossible to find an outside face for the object with the enclosing 
AABB. The solution to the problem is to extend the Vol by picking one of the faces and extruding it until it 
touches the side of the enclosing AABB. This is illustrated in Figure 17.5. In [Heidelberger et al., 2003] it 
is suggested to pick the opposite faces with minimum distance in between, which works as a fast heuristic 
since the depth complexity, total maximum number of entry, and exit points along any scan line of the 
LDIs is minimized. 

17.2.1 Computing LDI Using OpenGL 

To generate the LDIs, we set up an orthographic projection. We use the selected outside face as the near 
plane and the opposite face as the far plane. The remaining faces define the top, bottom, left, and right 
planes of the view volume. The LDI is rendered using a multipass approach, where each object is rendered 
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Figure 17.4: Third stage of the LDI algorithm; the two object LDIs are combined. 

n m ax times, and where n max denotes the maximum depth complexity of the objects inside the Vol. This 
gives an 0(n max ) complexity. Depth testing, face culling, and color buffer arc disabled; only the stencil 
test are used. 

In the first render pass, only the first fragment is allowed to pass the stencil test, and thus write its depth 
value to the z-buffer. The subsequent stencil tests is set to fail, but still the stencil buffer is incremented. 
After the first render pass, the depth buffer contains an object layer in the LDI and the stencil buffer 
contains a map of the depth complexity. We can therefore find n max by reading back the stencil buffer and 
searching for the maximum value. The values stored in the depth buffer arc copied to the first layer in the 
LDI. 

Additional rendering passes arc then performed from 2 to n max - This time the stencil test is changed. 
The idea is that in the n’th pass, the first n — 1 fragments fail the stencil test, while the stencil buffer is 
incremented, and only the n’th rasterized fragment is allowed to pass the stencil test, writing to the depth 
buffer. Thus after the n’th render pass the n’th LDI object layer can thus be read back from the depth 
buffer. 

A pseudocode version of the LDI generation for a single object is shown in Figure 17.6. As can be 
seen from the pseudocode, the fragments generated in the LDI arc in arbitrary order, which means that 
the LDI must be sorted before it can be used. For each pixel, only the n p layers arc sorted, where n p is 
the depth complexity of that particular pixel. If ri p < n max then the remaining layers n p + 1 to n max arc 
ignored. 

Two simple optimizations arc possible: the depth buffer can be tiled, such that each layer has its own 
tile, and such that only a single read-back is necessary for the additional passes. In the first pass the 
stencil and depth buffers arc packed into a single buffer using the GL_NV_packed_depth_stencil 
extension such that only a single read-back is necessary in the first render pass. 

Compared to algorithms that compute a signed distance map of the intersecting volume such as 
[III et ah, 2002], the LDI based method has a lower bandwidth, since the LDI representation requires 
much less storage than a signed distance map. This is an advantage, since high bandwidth is an Achilles’ 
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Figure 17.5: Extending Vol to get outside faces of object A. 


Algorithm LDI(...) 

//First render pass 

glClear(GL_DEPTH_BUFFER_BIT or GL_STENCIL_BUFFER_BIT) 
glStencilFunc(GL_GREATER,1,OxFF) 
glStencilOp(GL_INCR,GL_INCR,GL_INCR) 
render object 

depth complexity = glReadBack(stencil buffer) 

Timax = max(depthcomplexity) 

layer[1] = glReadBack(depth buffer) 

// Passes from 2 to n m ax 
n = 2 

while n <= n m ax do 

glClear(GL_DEPTH_BUFFER_BIT or GL_STENCIL_BUFFER_BIT) 
glStencilFunc(GL_GREATER,n,OxFF) 
glStencilOp(GL_KEEP,GL_INCR,GL_INCR) 
render object 

layer[n] = glReadBack(depth buffer) 
n = n + 1 
end while 
End algorithm 


Figure 17.6: Pseudocode of LDI generation of an object. 
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heel in today’s commodity graphics hardware due to high read-back latencies. 

As described, the algorithm is capable of computing the intersection volume. However, if the vertex 
in LDI approach is used instead during the third stage of the algorithm, then the algorithm will suffer from 
missing edge-edge intersection, and highly tessellated objects arc then required to get realistic collision 
handling in simulation. 

In [Heidelberger et ah, 2003, Heidelberger et ah, 2004] it is explained how the algorithm can be ex¬ 
tended to handle self-intersections by keeping track of whether rasterized fragments stems from front¬ 
facing or back-facing faces. 

Compared to the signed distance map approach explained in the previous section, this algorithm is not 
as well suited for contact generation. The signed distance map inherently provides separation/penetration 
distances and contact normals, while with the LDI approach a postprocessing step must be performed to 
determine this information. 

17.3 Previous Work 

In [Hoff III et ah, 2001] a hybrid geometric method and image-based method is presented for doing 2D 
geometric proximity queries using graphics hardware. Here a geometric approach based on bounding 
volume hierarchies is used to quickly reject intersecting objects or to home in on overlapping regions. 
Signed distance maps arc then computed for the overlapping regions and intersecting pixels of two objects 
arc found by a multipass rendering scheme. Separation and penetration distances arc found simply by 
looking up distance measures in the signed distance maps. Gradients provide contact normals, penetration 
directions, and separation distances. The technique for computing the generalized Voronoi diagrams in 
[Hoff, III et ah, 1999] is used for computing the signed distance maps. However, the method is only for 
2D objects and the problem of self-intersecting deforming objects is not addressed. 

The work in [Hoff III et ah, 2001] was extended into 3D in [III et ah, 2002]. The method was also 
extended to handle self-intersections by applying the classical parity test used in the shadow volume 
algorithm. 

In [Hirota et ah, 2000] elastic bodies arc simulated using distance fields, a FEM is used where an 
initial precomputed signed distance held on an undeformed tetrahedral mesh is deformed as the simulation 
progresses. The idea is to find the tetrahedron enclosing a point p, given the barycentric coordinates, u \, 
it 2 , « 3 , and ii 4 ; of the point p the deformed distance is found by a linear interpolation of the undeformed 
distance values, d\, (h, c/ 3 , and c / 4 at the nodes of the enclosing tetrahedron, that is, 

d = u\d\ + U 2 d 2 + u 3 d 3 + u^d^ 

The barycentric coordinates arc found easily by solving 

p = U4P1 + U2P2 + U 3 p 3 + U 4 P 4 

subject to u\ + U 2 + u 3 + U 4 = 1. The method is applicable to both static and deformable objects. 
Furthermore, it handles self-intersection and interpenetration in a unified manner. However, several details 
arc not explained, such as how one determines which points should be looked up in which tetrahedra. 
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In addition, it is not discussed what to do about handling edge-face intersections (where there arc no 
penetrating points). 

The work is extended in [Fisher et al., 2001] where a lazy evaluation of AABB BVHs is used to 
quickly locate regions of interest (overlapping tetrahedra) before doing linear interpolation of the penetra¬ 
tion depth at the tetrahedra nodes as we described above. The deformed distance held is partially updated 
in the region indicated by the overlapping AABBs. 

In [Bridson et al., 2003] the work from [Bridson et al., 2002] is extended. Cloth environment colli¬ 
sions arc detected by looking up cloth mesh vertices in a signed distance map and the signed distance 
maps arc used for generating contacts between cloth and environment interactions. Signed distance maps 
of the static objects arc precomputed on regular grids. During simulation of the cloth, vertices in the cloth 
mesh arc looked up in the regular grids by trilinear interpolation. Gradients of the signed distance held 
provide contact normals. The cloth meshes arc highly tessellated and edge intersections (i.e., where end 
vertices do not intersect the static object) arc ignored. Furthermore, a fold-and-wrinkle preserving tech¬ 
nique is presented, which exploits the signed distance map to project interpenetrating cloth out of objects 
in the scene. 

In [Guendelman et al., 2003], a double representation of rigid objects is proposed for each object 
a mesh and a signed distance held is stored in the objects local coordinate system; during simulation, 
contacts arc generated by looking up mesh vertices from one object in the signed distance held of another 
object and vice versa. BVHs arc suggested for storing vertices and speeding up the collision queries. As 
with many other methods edge-intersections arc ignored and the method relies on having a hne tessellation 
of the rigid objects. 

In [Baciu et al., 2002] a hardware-assisted method for self-collision of deformable surfaces is de¬ 
scribed based on image-based intersections; triangle surfaces arc divided into so-called (tt, (3) surfaces, 
where tv is a unit vector, such that for any surface normal n, we have 7r • n > cos f3, where 0 < /3 < -k/2. 
The idea is now simply to render the surfaces along their 7r direction and test for intersection (compare 
this with Dehnition 15.3). 

In [Fuhrmann et al., 2003], a method is suggested for computing signed distance helds of deformable 
objects. A new sign function dehnition is introduced and used for open surfaces (such as cloth which does 
not have an inside-outside), collisions detection is performed by looking vertices up in the signed distance 
helds, edge intersections arc handled by introducing an offset (making vertices hover over the surface), 
and the edge midpoints arc used for proximity tests to avoid particles getting trapped in surface gaps. In 
our opinion, these methods arc ad hoc methods, which reduce the chance for getting in trouble, but they 
do not solve the problem of not handling the edge intersection case. 

In [Heidelberger et al., 2004] an image space technique is presented; it is based on layered depth 
images combined with face orientation information. The method handles both collision and self-collision. 
However, the paper never goes into detail about how to generate contacts (separation/penetration distance 
and normals) based on layered depth images. 

In [Hirota et al., 2001, Hirota, 2002], the undeformed distance held of a deforming object is used to 
define a continuous gap function. The distance held is referred to as material depth and is stored in the 
nodes of a tetrahedral mesh, which is used for FEM simulation of soft tissue. 
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Physics-based animation in computer graphics relies heavily on standard methods from mathematics, 
physics, and engineering. In Paid V we give thorough introductions to the main topics used from these 
fields in relation to physics-based animation in computer graphics. These chapters were actually our 
starting point for writing this book, and even though many of the topics arc expected to be standard 
knowledge for the intended reader, we have chosen to include this paid for the sake of completeness. For 
further reading, we recommend: 

Algebra, Calculus, Analysis, and Geometry: 

• [Adams, 1995] 

• [Bronshtein et al., 1997] 

• [Hansen, 1988] 

• [Koenderink, 1990] 

• [Kreyszig, 1993] 

• [Magnus et al., 1988] 

• [Messer, 1994] 

Numerical Analysis: 

• [Ames, 1969] 

• [Boor, 1978] 

• [Burden et al., 1997] 

• [Eriksson et al., 1996] 

• [Golub et al., 1996] 

• [Morton et al., 1994] 

• [Press et al., 1999a] 

Linear Complementarity Problems: 

• [Cottle et al., 1992] 

• [Murty, 1988] 

Classical Mechanics: 

• [Arnold, 1989] 

• [Boas, 1983] 

• [Goldsmith et al., 1987] 

• [Kleppner et al., 1978] 

Continuum Mechanics and Fluid Dynamics: 


[Lautrup, 2005] 
[Tritton, 1988] 
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Vectors, Matrices, and Quaternions 


This chapter will give a brief introduction to the notation we use for vectors, matrices, and quaternions. 
The notation is greatly inspired by [Weisstein, 2005b, Elbek, 1994]. We will assume that a Cartesian 
coordinate system is used unless otherwise and explicitly stated. 

18.1 Vectors 

A vector is a geometric tool with a head and a tail for the representation of length and direction, but not 
position. All parallel vectors with identical length arc thus geometrically identical. Vectors may be used to 
represent position by placing the tail at the origin of our coordinate system, but care should be taken since 
position and vectors translate differently: a position changes during a translation, while a vector remains 
unchanged. 

A vector representation is an ordered list of scalars, and an example of a vector is 


(18.1) 


where the scalars v, arc called the elements or components of the vector. We write that v is an n- 
dimensional vector as v £ M". A basic property of a vector is its length. 

Definition 18.1 (Vector Length ) 

The (Euclidean) length of the vector v E R n is 


IMI 2 = yvj + v% + --- + v%. 

A unit vector is a vector with length 1, and the zero vector, 0, has length zero. 

Vectors arc additive and have three different products: scalar-vector, dot, and cross products. 

Definition 18.2 (Vector Addition ) 

The addition of two vectors a. b £ M" is 


(18.2) 


a + b = 


a\ + b\ 
a 2 + b 2 

(In "T" Oil 


(18.3) 
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Definition 18.3 (Scalar-Vector Product) 

The product between a scalar k £ M and a vector v <G R n is 


kv = 


kv i 
kv 2 


kv n 


Definition 18.4 (Dot Product) 

The dot product between two vectors a. b <E M n , 


n 

a • b = a t bi. 
i= 1 


(18.4) 


(18.5) 


Definition 18.5 (Cross Product) 

The cross product of two vectors a, b € R 3 is 


a x b = 


d 2 h 
0.3b 1 
ai & 2 


«3&2 

ai&3 

a 2 6i 


(18.6) 


Two vectors arc perpendicular, denoted a 1 6, when a ■ b = 0, and they arc parallel, a || b, when 
a x b = 0. The dot product is proportional to the lengths of the vectors and the angle between them as 

a-h=||a|| 2 11 £> 11 2 cos 6*, (18.7) 


where 6 is the angle between them and the dot product of a vector with itself is naturally the squared 

II 11 2 

length of the vector, v ■ v = \\v\\ 2 . The length of a cross product is 


a x b || = a|| |61| sin0. 

(18.8) 

The dot product is commutative, 

a ■ b = b ■ a, 

(18.9) 

associative with scalars 

(ka) ■ b = k(a ■ b ), 

(18.10) 

and distributive with addition 

a ■ (b + c) = a ■ b + a ■ c. 

(18.11) 

The cross product is neither commutative, 

a x b = —b x a, 

(18.12) 

nor associative with cross product, 

a x (b x c) = b(a ■ c ) — c(a • b). 

(18.13) 


© 


© 


© 


© 



book 


2005/9/30 — 15:44 — page 573 — 11585 


18.1 PHYSICS-BASED ANIMATION 


573 


However, it is distributive with addition as 

ax (b + c) = a x b + a x c, (18.14) 

and associative with scalar products as 

(. ka ) x b = k(a x b) . (18.15) 

The cross product of a and b is perpendicular to both a and b. 

Definition 18.6 (Coordinate Systems) 

An n-dimensional coordinate system is a set of n vectors, e t 6 R n , i = 1... n, called the basis, and 
a zero point, called the origin. For a 3D space, the coordinates are commonly called i = x, y, z. The 
vectors must span the n-dimensional space, that is, all positions of the space must be reachable as a 
linear combination of the basis, 

x = a\e\ + a 2 e 2 H- \~ a n e n . (18.16) 


The coefficients a* are the representation of the vector a in the basis e t . When 

* 

1, ifi=j, 

0, otherwise , 

then the vectors constitute an orthonormal basis, and the coordinate system is called orthogonal. 


(18.17) 


Two nonparallel vectors and their cross product (a, b,a x b) make up a right-handed coordinate system. 
Right-handed coordinate systems arc related to determinants discussed in Definition 18.23. An often-used 
coordinate systems is the body frame coordinate system, which is placed and aligned conveniently with 
respect to an object. We denote a vector in coordinates in a body frame as a BF . In parallel, a single global 
coordinate system is almost always needed, and this is sometimes called the world coordinate system 
0 WCS),a wcs . 

A construction often appealing in vector equations is the space product. 

Definition 18.7 (Space Product) 

For a. b. c £ R 3 the space product is given as 


a x b ■ c = 


ai{b 2 c 3 - b 3 c 2 ) 
Q- 2 (b 3 c\ - b\c 3 ) 
a 3 (bic 2 - b 2 d) 


= det([a|6|c]) 


(18.18a) 

(18.18b) 


where det() is the determinant discussed in Definition 18.23 

The space product measures the volume of the parallelepiped spanned by the vectors a, b, and c, as 
shown in Figure 18.1. It is invariant under interchanging of the cross product and the dot product, and 
cyclic permutations of the vectors. 

In animation we often use vectors to express dynamic quantities, where we consider vectors as func¬ 
tions of time, a(t) : R —> R n and analyze their change with time. 
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Figure 18.1: Three vectors defining a parallelepiped. 


Definition 18.8 (Time Derivative of a Vector) 

For a time varying vector, v(t) : M —» M n , the derivative w.r.t. time is given as 


dv 

dt 


dv i 
dt 
dv 2 
dt 


dv n 

L dt J 


Definition 18.9 (Derivative of a Dot Product) 

The time derivative a dot product between two vectors a, b 


is 


d da db 

— (a ■ b) = — ■ b + a ■ —, 

dt v dt dt 


(18.19) 


(18.20) 


Definition 18.10 (Derivative of a Cross Product) 

Assuming that two vectors a and b vary with a variable t, then the derivative of their cross product is 

d , ,, da db _ 

— (axb) = — xb + ax—. (18.21) 

dt y ’ dt dt 

Derivatives of vector fields arc treated in Section 18.3. 

Definition 18.11 (Vector Norm ) 

A vector norm on M n is a function ||-||,/rom M n into M with the following properties: 

(i) ||r|| > 0 for r f 0, and ||r|| = 0 if and only if r = 0, 

(ii) ||o:r|| = |a| ||r||/or all r e M n and a e M, 

(iii) ||r* + y || < ||r|| + ||y||/or all r,y £ M n . 

Many norms arc possible, two such arc the I 2 and the norms. 

B—, , ,—© 
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Definition 18.12 (The I 2 and l !0a norms) 

The 1-2 and norms for the vector r = [ro, r\, ..., r n _ i] T are defined by 


2 



and llrll = max In 

00 0<i<n—1 


(18.22) 


The I 2 norm is called the Euclidean norm of the vector r. For a vector r £ M n the norms arc related by 


< 


\r \\ 2 < \/n\\r\ 


(18.23) 


18.2 Matrices 

A set of m linear equations in n variables may be written on the following form 

U\ = anVi + ai 2 n 2 H - 1- ai n v n (18.24a) 

U2 = 021^1 + 022^2 H - 1- Cl2nVn (18.24b) 


ttm — + d rn 2V2 T ' ' ' T Q'mn'Vnr 

where u l] arc the coefficients. This is written on matrix form as 


where A is the table of scalars a t j, 


u = Av, 



an 

a 12 ■ 

. . Clin 

(121 

a 22 


d?7ll 

Clm2 

• • CL rnn 


(18.24c) 

(18.25) 


(18.26) 


The scalars a tJ arc also called the matrix elements, and the matrix is said to have m rows and n columns, 
and we write A £ M mxn . A square matrix has m = n. A matrix with either m = 1 orn = 1 is called a 
row or column vector respectively. 

Definition 18.13 (Diagonal Matrix) 

A square matrix, D, is called diagonal if, and only if: 


dij = 0 when i f j. (18.27) 

That is, only the entries da, may be different from zero in a diagonal matrix. 

Definition 18.14 (Lower Triangular Matrix) 

A matrix, L, is lower triangular if, and only if: 


lij = 0 when i < j. 


(18.28) 


That is, there are only values below the diagonal. 
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Definition 18.15 (Upper Triangular Matrix) 

A matrix, U, is upper triangular if, and only if: 

Uij = 0 when i > j. (18.29) 

That is, there are only values above the diagonal. 


A strictly lower or upper triangular matrix is a lower or upper triangular matrix, which has zero diagonal 
as well. 


Definition 18.16 (Identity Matrix ) 

The diagonal matrix: 

/I 0 ...ON 

1 = ° 1 ! 

: 0 

V 0 ... 0 1 / 


(18.30) 


is called the identity matrix. Often in the literature, the identity matrix is denoted I. Unfortunately, in 
classical mechanics the I-notation is often used to mean an inertia tensor, which is why we use 1 for the 
identity matrix. 


Matrices are additive and have two matrix products defined. 

Definition 18.17 (Matrix-Matrix Addition ) 

The addition of two equal sized matrices, A,B £ M mxn , is 

C = A + B, (18.31) 

where Cij = a^j + bij. 

Definition 18.18 (Matrix-Scalar Multiplication ) 

Given a scalar k £ M and a matrix A £ M mxn , the product, 

B = kA , (18.32) 


is given as 

bij — kcLij. 

Definition 18.19 (Matrix-Matrix Multiplication ) 

Given two matrices A £ R mxn and B £ R oxm , then the product 


(18.33) 


C = BA, 


(18.34) 


is defined as, 

n 

Cij — ''f ’ bjpdpj. (18.35) 

p= i 

The result is the matrix C, where C £ M oxn . Its product is undefined, when the number of columns in B 
differs from the number of rows in A. The identity matrix 1 is the neutral element for matrix multiplication. 



“book” — 2005/9/30 — 15:44 — page 577 — #589 


18.2 PHYSICS-BASED ANIMATION 


577 


Definition 18.20 (Transpose Matrix ) 

The transpose of A is written as A 1 , and is found as 


B = A t 

(18.36a) 

4 


■£~5 

<3 

II 

-o 

(18.36b) 

Naturally, transpose is its own inverse, 

(A T ) T = A 

(18.37) 

But further, 

(A + B) r = A T + B t , 

(18.38) 

and 

( AB) t = b t a t . 

(18.39) 

for matrices A and B. 



Definition 18.21 (Inverse Matrix ) 

The inverse matrix of A € R nxn is written as A 
matrix 

A~ l A 


The inverse of a matrix product is, 

( AB r 1 = B~ 1 A~ 1 , 

(18.41) 

and further 

(18.42) 

Finally, 

AT 1 = (^') T 

(18.43) 

A square matrix, A , has an inverse if, and only if, det(A) 0. Common methods for calculating the 

inverse of a square matrix arc: Gauss-Jordan elimination, Gaussian elimination, and LU decomposition, 
which can be found described in [Press et al., 1999a]. The reader may also consult Chapter 19, where 
iterative methods for solving systems of linear equations arc described. 

Definition 18.22 (Trace of a Matrix ) 

The trace of a square matrix A 6 M nxn is given as 


n 

Tr(A) = y2 a n- 

i= 1 

(18.44) 

The trace is invariant under the transpose operator, 


Tr(A) = TV ( A t ) , 

(18.45) 


\ and the following property must hold for the inverse 
= A A" 1 = 1. (18.40) 
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it is associative with scalars, 


Tr (kA) = k Tr(A), 


for any scalar k <G M. The trace is further distributive w.r.t. addition, 


Tr(A + B) = Tr(A) + Tr (B), 


and invariant under multiplication, 

Tr(AB) = Tr(BA). 

Definition 18.23 (Determinant of a Matrix ) 

The determinant is only defined for square matrices. The determinant of A € M 2x2 is, 


A\ = det(A) 


= det 


Oil 

012 

021 

022 


on ai 2 

O 21 <322 


— Oll«22 — <321012- 


(18.46) 

(18.47) 

(18.48) 


(18.49a) 

(18.49b) 

(18.49c) 

(18.49d) 


The determinant for A G M nxn is defined recursively as 


Oil 

012 • 

Olra 


022 

023 • 

• • n 


021 

023 • 

■ • «2ra 

021 

022 • 

• • a 2n 

— On 

032 

033 • 

■ • CL3n 

- an 

031 

032 • 

■ • 03ra 

Oral 

On2 • 

• • Oran 


O n 2 

«ra3 • 

• • Q"nn 


Oral 

O n 3 • 

■ • Oran 


021 O 22 • • • 0 2 ( n _i) 
031 O32 ... 03(„_!) 


| ^nl ^n2 • • • ^n(n—1) 

n 

i= 1 


(18.50a) 


(18.50b) 


where Mij is the minor of matrix A produced by removing row i and column j in A. 

The determinant is commutative with inversion, 

det A’ 1 = (det(A)) -1 , (18.51) 

and it is distributive w.r.t. matrix multiplication, 

det(A_B) = det(A) det(.B), (18.52) 


when matrices A and B are invertible. 
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The determinant is a tool often used when solving linear systems of equations, and it has geometrical 
importance, since it is related to the area of parallelepipeds formed by two vectors u.v G R 2 is given as 
det([it|w]). In n dimensions, the volume of the parallelepiped spanned by the vectors v , v n is given 
as det([t?i| ... |u n ]), and the sign is the orientation of the parallelepiped. That is, in the usual three-space, 
the unit vectors e x , e y , and e z have a positive determinant in a right-handed coordinate system , that is, 
det([e :I .|e y |e,]j = 1, while a left-handed coordinate system has a negative determinant. These coordinate 
systems arc in mirror relation like the right and left hand as indicated by the difference in sign of the 
determinant. 

Definition 18.24 (Orthogonal Matrix) 

A matrix, R, is said to be orthogonal, if, and only if, 


R t = R 1 . 


(18.53) 


As a consequence, 


R t R= 1 


(18.54) 


As a consequence of Definition 18.24, the row or its column vectors form an orthonormal basis. 

theorem 18.1 (Rotation Matrix) 

An orthogonal matrix is a rotation (and possibly a mirroring) transformation. 

theorem 18.2 (Coordinate Transformation) 

Assume that we have two orthogonal coordinate systems labeled BF and WCS, and three unit vectors: 
r |. 7 ' 2 , r : > <E M", given with respect to WCS, and such that they are parallel with the axes of BF. That is, 
ri _L 7*2 -L 7 - 3 . We now construct a matrix R E R nx3 , such that its columns are equal to r\, r 2 , and r 3 . 
The R will transform a vector x given with respect to BF into the corresponding vector given with respect 
to WCS, that is, 


R = [ri I r 2 I 7*3] (18.55) 

Rx bf = x wcs , (18.56) 

where \r\ | 7*2 | 7 - 3 ] is the matrix produced by concatenating the three column vectors. The inverse, 

1 J ~R r T 1 T r T . r F 

R = R , will have rf, r£, and r;> as rows, and R will transform x given in WCS into the corre¬ 
sponding vector in BF, that is, 

(18.57) 


(18.58) 


R t = 


4 


R t x wcs = x 


„T' 


3 J 


BF 


where R 7 is produced by concatenating the three row vectors. 
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Proof of Proposition 18.2: 

Let’s start by examining R , where the columns arc given by r i, r* 2 , and r 3. Since R is a linear transfor¬ 
mation, it is enough to show that the axes of BF transforms into the axes of WCS. Let’s begin with the 
first axis of BF. 


Rr^ F = R 


1 

0 

0 


= [tt | r 2 | r 3 ] 


1 

0 

0 


= n 

_ ^WCS 
— r l 


(18.59a) 

(18.59b) 

(18.59c) 

(18.59d) 


Similarly, we find for the second and third axes, iirf F = r^ cs and Rr F] = r * cs . We have now proven 
the first half of our proposition. The column version of R is a rotation matrix, and the row version is its 
transpose, implying that it is the inverse rotation. This completes our proof. □ 


Definition 18.25 (Symmetric Matrix) 

A matrix A is a symmetric matrix, if it is equal to its transpose matrix, that is, 


A = A t . (18.60) 

For every matrix B, both B 1 B and BB 1 are symmetric. 

theorem 18.3 (Properties of a Symmetric Matrix) 

A symmetric matrix A has the property that it can be written as 

A = RDR t , (18.61) 

where R is an orthogonal matrix and D is a diagonal matrix. 

Proof of Proposition 18.3: 

We evaluate the transpose of (18.61) 

A t = (RDR t ) t 
= ( DR t ) t R t 
= ( R t ) t D t R t 

= rdr t , 


(18.62a) 

(18.62b) 

(18.62c) 

(18.62d) 


where we repeatedly used (18.39), and the fact that both the double transpose and the transpose of a 
diagonal matrix arc the identity operation. Since (18.61) and (18.62d) arc identical, we have proven the 
proposition. □ 
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Definition 18.26 (Skew Symmetric Matrix) 

A matrix A is said to be skew symmetric if its transpose is equal to its negation, 

A t = -A, (18.63) 

implying that the diagonal is zero. Every square matrix B can be decomposed into a sum of a symmetric 
and skew-symmetric matrix, 

B = ^(B + B t ) + (B-B t ). (18.64) 

theorem 18.4 (Cross Product Matrix ) 

The cross product between two three-dimensional vectors a and b can be written as a matrix product 
using the cross product matrix. The cross product matrix is defined as follows: 

( 0 a z (ly \ 

a* 0 -a x , (18.65) 

Cly CL X 0 J 

where we have 

a x b = a x b. (18.66) 

Proof of Proposition 18.4: 

By straightforward computation of the cross product between the two vectors a and b we get 

( dybz byCL z \ 

b x a z — a x b z = a x b. (18.67) 

&xby b X Ojy J [ ] 


theorem 18.5 (Properties of the Cross Product Matrix) 

The cross product matrix is skew symmetrical. 


Proof of Proposition 18.5: 

If we transpose the cross product matrix from Proposition 18.4, then we get 


a = 


and we have shown what we wanted. 


0 a z —a y 
CL Z 0 tX x 

try Oj X 0 


= —a 


(18.68) 

□ 


There arc several useful rules for the cross product matrix. We have listed some of them below: 

a x b = a T b x 

a x b = —b T a x 

a x b = b x a 


(18.69) 

(18.70) 

(18.71) 


It is not difficult to prove these rules by using Proposition 18.4 and Proposition 18.5. We will leave this as 
an exercise for the reader. 
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Definition 18.27 (Eigenvalues and Eigenvectors ) 

Given a matrix A. a scalar X, and a vector v, such that 

Av = Xv. (18.72) 

The scalar X is called the eigenvalue of A and v is called the corresponding right eigenvector of A. The 
similar equation 

u T A = Xu T , (18.73) 

define the left eigenvector. Left and right eigenvalues are equivalent, but this does not hold in general for 
left and right eigenvectors. Nevertheless, it is often sufficient to consider the right eigenvector, and it is 
thus referred to as the eigenvector. 

Letting V = [vi|v2| • • • \ v n[ be the matrix fanned by all the right eigenvectors of a matrix A E M nxn 
and D = diag(X\, X 2 , ■ ■ ■, X n ) be a diagonal matrix of corresponding eigenvalues, then we can write the 
full system of eigenvalues and vectors as 


AV = VD, (18.74) 

and equivalently for the left eigenvectors. 

U t A = DU t . (18.75) 

The left eigenvector is the right eigenvector of the transpose matrix A T , since 

u T A = Xu T , (18.76a) 

(u T A) T = (Xu T ) T , (18.76b) 

A t u = Xu. (18.76c) 

theorem 18.6 (Left and Right Eigenvectors of a Symmetrical Matrix) 

The left and right eigenvectors of a symmetric matrix are identical. 

Proof of Proposition 18.6: 

For a symmetric matrix, A = A T , hence by (18.76c) we realize that a left eigenvector is also a right 
eigenvector. Since the eigen system only has n eigenvalues and vectors, where A G R nxn , then we 
conclude that the left and right eigenvectors arc identical. □ 

theorem 18.7 (Orthogonality of Eigenvectors) 

Right and left eigenvectors are orthogonal on each other, that is ifv .i and Uj are right and left eigenvectors 
for a matrix A corresponding to eigenvalue A j and Xj, then 

Vi ■ Uj = 0, when i f j. (18.77) 
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Proof of Proposition 18.7: 

Stalling with (18.74) and (18.75) and applying the matrices of left and right eigenvectors we find that 


u t av = u t vd, 
u t av = du t v, 


(18.78a) 

(18.78b) 


and we conclude that (U 1 V)D = DiU 1 V). Since D is a diagonal vector, then so must U 1 V be. This 
completes our proof. □ 

Eigenvalues and eigenvectors arc related up to a scaling constant, i.e., if A and u arc corresponding eigen¬ 
values and eigenvectors solving (18.72), then cA and -u is also a solution to (18.72) for any constant 

cfi 0. 

theorem 18.8 (Eigenvalues and Eigenvectors of a Symmetrical Matrix) 

For a symmetrical matrix A we have: 

A = RDR t , (18.79) 

where R is an orthogonal matrix. Choosing eigenvectors of A to be unit vectors, then the columns of R 
will be the eigenvalues of A and the diagonal of D will hold the corresponding eigenvalues. 

Proof of Proposition 18.8: 

Assuming that U and V are the system of unit right and left eigenvectors of matrix A. Multiplying 
(18.74) with the left eigenvectors we find that 


avu t = vdu t . 


(18.80) 


Since A is symmetric we know that the left and right eigenvectors arc identical. Writing R = U = V we 
find. 


ARR t = RDR r , 
A = RDR t , 


since RR 1 = 1. 


Definition 18.28 (Positive Definite) 

A symmetrical matrix A is positive definite, if and only if 

v T Av > 0 for all v f 0 

theorem 18.9 (Properties of a Positive Definite Matrix) 

A positive definite matrix, A, can be written as 


(18.81a) 

(18.81b) 

□ 


(18.82) 


A = RDR t , 


(18.83) 


where R is an orthogonal matrix, and D is a diagonal matrix. Furthermore, all the diagonal elements of 
D circ strictly positive, da 0, Vi G [0 ... tt-]. 
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(a) (b) (c) 

Figure 18.2: Visualizing positive-definite, negative-definite, and indefinite matrices. 


When discussing the definiteness of matrices, it is useful to have the pictures in Figure 18.2 in mind. In 
the figure is shown the visualization of the function, 

/(r) = r T Ar , (18.84) 


for the following three matrices, 


Positive-definite A 
Negative-definite A 
indefinite A 


1 0 
0 lj ’ 

-1 0 ' 

0 -lj ’ 

1 0 ' 

0 -1 ' 


(18.85a) 

(18.85b) 

(18.85c) 


For the positive-definite, negative-definite, and indefinite matrices, there is a single minimum, maximum, 
and saddle point respectively at [0, 0] r . 


Proof of Proposition 18.9: 

The form of A follows directly from Proposition 18.3. In order to show the positivity of D, we will use 
Definition 18.28 and write 


v 1 Av > 0. 

(18.86) 

Using Proposition 18.3, we get 


v t RDR t v > 0. 

(18.87) 

Finally, with a little rewriting we have 


(R T v) T D (R t v) > 0. 

(18.88) 
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The expression R r v corresponds to a rotation of the vector v, so using w = R T v, we can rewrite our 
equation into 

w t Dw > 0. (18.89) 

We know that D is a diagonal matrix, and the equation can therefore be rewritten into 

dnwl 4-b d nn Wn > 0. (18.90) 

Since w is a rotation of the vector v with the property, 

vfiO, (18.91) 

then it must also hold that 

to / 0. (18.92) 

That is, there always exist one or more nonzero elements of w, Wi f 0. Therefore, we must have 

wf > 0 for all i E [1... n}. (18.93) 

The equation in (18.90) holds for an arbitrary vector w 0. Hence, choose w such that wj = 0 for all 
j E [1 ... (k — 1), (k + 1)... n\, then by (18.90) we conclude that dkk > 0, and this must hold for all 
(•//-/., k E [1 ... n\. This completes our proof. □ 

Definition 18.29 (Positive Semidefinite) 

A symmetrical matrix A is called positive semidefinite if, and only if 

v 1 Av > 0 for all v 0. (18.94) 

theorem 18.10 (Properties of a Positive Semidefinite Matrix) 

A positive semidefinite matrix A can be written as 

A = RDR t , (18.95) 

where R is an orthogonal matrix, D is a diagonal matrix, and all the diagonal elements of D are non¬ 
negative, d t i > 0; i E [1... n\. 

Proof of Proposition 18.8: 

The proof of this is almost identical to the proof of Proposition 18.9, and will be left to the reader as an 
exercise. □ 

theorem 18.11 (Elliptical Level Sets of a Positive Definite Matrix) 

Given a positive definite matrix M and a constant scalar c, then all vectors v that satisfy 

c = v T Mv, (18.96) 


will lie on an ellipse. 
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Proof of Proposition 18.11: 

Since M is symmetrical, there exists an orthogonal matrix R and a diagonal matrix D, such that 

c = v T Mv (18.97a) 

= v t RDR t v (18.97b) 

= (. R t v) T D (. R t v ) . (18.97c) 

The term R r v corresponds to a rotation of the vector v. Writing w = R T v, we get 

c = w t Dw. (18.98) 

Since D is a diagonal matrix, we may rewrite the equation as 

c = duw\ H-b d nn v? n . (18.99) 

Finally, since all da > 0, we realize that the equation represents an ellipse. □ 

theorem 18.12 (Ellipses as a Representation of Eigen Structure) 

Referring to the proof of Proposition 18.11, the elliptical structure, 



c = d\w\ + • • • 

+ d n Wn, 

(18.100) 

corresponds to the ellipse given by 

c = d\v\ + ■ ■ ■ 

+ d n Vn , 

(18.101) 

rotated by R r , and the axes are related as 




R 1 = [w i\w 2 \ 

... \w n ] 

(18.102) 


From this we note that the first axis corresponds to the first column of R, and it follows that the i’th non- 
rotated axis corresponds to the i’th column of the matrix R. Recalling that R consists of the eigenvectors 
of M, then we can conclude that the i’th axis of the ellipse, 

c = v t Mv, (18.103) 

is the i’th eigenvector of AT, and the eigenvalues are the scaling of the axes. 

Definition 18.30 (Matrix Norm) 

A matrix norm is defined for square matrices, A E R nxn and is a function || • |, from R nxn into R with the 
following properties, 

(i) || A|| > 0 for A E R nxn , A/0, and || A|| = 0 if, and only if, A = 0. 

(ii) ||aA|| = |a| \\A\\for all A E R nxn and a E R, 

(iii) || A + B\\ < || A|| + \\B\\for all A,B E R nxn . 
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(iv) ||AB|| < \\A\\\\B\\forall A,B eR nxn . 

Several useful norms exists such as the matrix p-norm, Frobenius norm, and the spectral norm. 

Definition 18.31 (Matrix p-norm) 

The matrix p-norm is defined as, 

IIAII = max IIArll (18.104) 

p ij r |T =1 "p 

II lip 

for any scalar 1 < p < oo. 

Definition 18.32 (Frobenius Norm) 

The Frobenius norm of a matrix A E M mxn is defined as, 


= \/Tr (AA t ) = 




EE 4- 

i=l j =1 


(18.105) 


Definition 18.33 (Spectral Norm) 

The spectral norm of a matrix A is given as the square root of the maximum eigenvalue of A T A, or 
equivalently, 



max 

p|| 2 #o 



(18.106) 


18.3 Scalar and Vector Fields 

A scalar field is a function defined in every point of space, / : M n —> M, e.g. f(x, y, z ) in 3D space. It is 
useful to think of a scalar field as a gray-valued image, where each position has a scalar value, the image in¬ 
tensity. A vector field and a matrix field is equivalently a vector and a matrix function defined in every point 
of space, e.g., the gradient vector of a scalar field, </> : M n —> W n , e.g. > 9 ^g’y ,Z ' > > d ^gf ,z ^ ] T - 

Fields often vary with time. 


18.3.1 Differential Forms 


Following [Magnus et al., 1988] we will calculate the derivatives of matrix equations using differentials 
also know as differential forms. The differential or total derivative of a function f(x , y, z ) is given as. 


df 


df df Of 
-f-dx + -fi-dy + -f-dz 
dx dy dz 


(18.107) 


where || etc., arc the partial derivatives and dx etc., arc the unit differentials. The differential may be 
thought of as a mapping from vectors into numbers. Pictorially, the differential is tangent where the 
partial derivatives jd etc. arc the tangent vectors, and the differentials dx etc., arc the coordinates. This is 
illustrated in Figure 18.3. 
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Figure 18.3: A function /, its tangent at xq, and the corresponding differential. 


Consider the simplest matrix function, 

F{x) = Ax, (18.108) 

where the vector x € W 1 is the vector field of Cartesian coordinates, and assume that A <G M mx " is a 
constant matrix independent of position. Then for every point * we conclude that the matrix function 
F : M n —► WL m . Expanding the terms we find. 


M x ) 

— anxi + CL 12 X 2 + • • 

+ di n Xn 

(18.109a) 

f 2 (®) 

= CI 21 X 1 + 0-22^2 + ' ' 

CL2n%n 

(18.109b) 

fm(x) 

= a m iX\ + a m 2 X 2 + • 

H - 

(18.109c) 


where X{ is the i’th coordinate for the position x in space. For each equation we now calculate the 
differential, and since x t is the coordinate function, then it only depends on the i coordinate, that is 


dxj _ I 1 if i = j, 
dxj 10 otherwise. 


Therefore, the differentials of each equation arc simply. 


(18.110) 


dfi = 
df 2 = 


dfm — 


andxi + ai 2 dx2 + ■ ■ 

-|- dxjidxyi 

(18.111a) 

a 2 \dxi + a, 22 dx 2 + • • 

~\~ d2 ndx n 

(18.111b) 

a ml dxi + ci m 2 dx 2 + • 

dmndXm 

(18.111c) 
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which we may summarize as, 

dF = Adx, (18.112) 

and in analogue to scalar derivatives we say that A is the derivative of F w.r.t. x. 

To find the partial derivatives of a matrix expression, we may thus calculate the differential of the 
expression and identify the leading term of the corresponding unit differentials. However, since the partial 
derivatives of matrix equations arc closely linked to the Taylor series, it is often simpler to calculate the 
derivatives as the linear paid of F(X + dX) — F(X), (see Chapter 20 for a discussion of Taylor series). 
The Taylor series to first order is given as, 

F{X + AX) = F(X) + —A1 + C>(||AX|| 2 ), (18.113) 

OJi. 

and taking AX —> 0 we identify the matrix as the derivative of F w.r.t. X. Again, for every position 
or matrix X this is simply the process of approximating the matrix function F with a linear function dF. 
The matrix coefficient 4^ is often called the Jacobian and is intimately related to the gradient operator to 
be discussed in the next section. The Taylor series gives us an interpretation of the total derivative as the 
change of F when we move in some specific direction AX. 

theorem 18.13 (Differential of a Constant Matrix) 

The differential of a constant matrix A is zero, 

dA = 0. (18.114) 

The proof follows immediately, since the Jacobian is zero. 

theorem 18.14 ( Differential of a Scalar-Matrix Product) 

Given k G M and a matrix X, then 

d(kX) = kdX, (18.115) 

and hence ^ynP~ = k- 

Proof of Proposition 18.14: 

We evaluate F(X + dX) - F{X) for F(X) = kX: 

k(X + dX) — kX = k dX, (18.116) 

and the linear term is trivially identified to be k. □ 

theorem 18.15 (Differential of a Matrix Sum) 

Given two matrices X and Y then 


d(X + Y) = dX + dY, 


and hence ^a = dix ^Q = L 


(18.117) 
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Proof of Proposition 18.15: 

We evaluate F(X + dX,Y + dY) - F(X, Y) for F(X, Y) = X + Y: 

(X + dX) + (Y + dY) - {X + Y) = dX + dY, (18.118) 

and the linear terms arc both trivially identified to be 1. □ 

theorem 18.16 (Differential of a Matrix-Matrix Product) 

Given two matrices X £ M mxo and Y £ M oxn , then 


d(YX) = (dY)X+ YdX, (18.119) 

and hence = ^ 

Proof of Proposition 18.16: 

We evaluate F(X + dX,Y + dY) - F(X, Y) for F(X, Y) = YX. 

('Y + dY)(X + dX) - ( YX) = (dY)X + Y dX + dY dX, (18.120) 


and the linear term for dX is identified to be Y, while the linear term for dY is more complicated, 
since matrix multiplication does not commute. There exists an extended set of operations to write such 
equations on a simpler form called the vec operator and the Kronecker product, but their definition and 
usage is outside the scope of this book. The interested reader is urged to consult [Magnus et al., 1988]. □ 

Lemma 18.1 (Differential of a Linear Transformation of Vectors) 

A simpler result than Proposition 18.16 is obtained for the product between a constant matrix A £ R mxn 
and a vector x £ M nxl ; 

d(Ax) = Adx, (18.121) 

since dA = 0, and the linear term is thus A. 


Lemma 18.2 (Differential of a Quadratic Form) 

From Proposition 18.16 we also easily derive the partial derivatives of a common quadratic involving 
matrices Y £ R nxn and x, Z £ M nxl : 

d(x T Yz) = (dx T )Yz + x T (dY)z + x 1 Y dz. (18.122) 

The linear term w.r.t. dz is x 1 Y. Furthermore, ( dx T )Yx is a scalar, and it is therefore invariant under 
the transpose operator. Hence, we may rearrange the first term to be x T Y J dx, and we identify the linear 
term w.r.t. dx to be x 1 Y 1 . Again we find that the linear term w.r.t. dY is difficult to write on a simple 
form, since matrix multiplication does not commute. Finally, in the specific case ofx = z, then the above 
simplifies to 

d{x T Yx) = x t (Y t + Y) dx + x T (dY)x. (18.123) 

theorem 18.17 (Differential of an Inverse) 

Given an invertible matrix X £ M nxn , then 


dX~ l = - 


X~ 1 (dX)X- 1 . 


(18.124) 
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Proof of Proposition 18.17: 

Since 1 = XX- 1 = X ~ x X we have, 

0 = dl = d{XX- 1 ) = ( dX)X ~ 1 + X dX- 1 , 

Isolating dX- 1 we find 

dx- 1 =-X~ 1 (dX)X~ 1 . 


theorem 18.18 (Differential of a Trace) 

Given a matrix X £ R nxn , then 

dTr(X) = Tc(dX). 

Proof of Proposition 18.18: 

We evaluate F(X + dX) - F(X) for F(X) = Tr(X): 

Tr(X + dX) - Tr(X) = Tr (dX). 


(18.125) 

(18.126) 

□ 


(18.127) 


(18.128) 

□ 


18.3.2 The Gradient Operator 


The gradient operator is an essential operator for fields, and it is used in computational fluid dynamics just 
as commonly as the derivative operator. We have therefore reserved a full section for it and its relatives. 
The symbol for the gradient is most often the Greek letter Nabla. 

Definition 18.34 (Nabla Operator) 

The nabla or “del” differential operator in three-dimensional space is defined as 


V = 


d d d 

e x~x —^ e y a —^ e -z W 
ox oy oz 


(18.129) 


where e x , e y , and e z are the coordinate axes using Einstein’s summation convention, we can write the 
nabla operator as 

V = ei |; (18.130) 

di 

where it is understood that the right-hand side is a sum over i, where i iterates over the names of the 
coordinate axes in the space the operator is used, i — x'' y'' z’. A final notation in 3D is 

d x = ^- = V, (18.131) 

ox 

where x represents the complete domain, e.g., for a 3D function x = \e x , e y , e z ] T or more commonly 
simplified to x = [x, y, z] T . If only parts of the domain are specified, e.g., V \ XiZ \t, then the nabla operator 
is restricted to the corresponding subspace, that is, 


V 


[x,z ] 7 



(18.132) 


This is not to be confused with a directional derivative discussed in Propositions 18.19 and 18.39. 
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Definition 18.35 (Gradient of a Scalar Field ) 

Applying the Nabla operator on a gradient field f gives the gradient vector field: 


V/ = V/, 


(18.133) 


which for f : M 3 —> K gives, 


V/ 


~ df(x,y,z) ~ 
dx 

df{x,y,z ) 
dy 

df(x,y,z) 

L dz J 


(18.134) 


The gradient denotes the direction in which the scalar field increases, and its length is equal to the deriva¬ 
tive of the scalar field in the gradient direction. Furthermore, 


C 


V/ • dr = ® V/ • t dr = 0, 


c 


(18.135) 


which states that the circular integral of the gradient operator on a closed curve, r G C, is zero, where t 
is the tangent to the curve. 


Definition 18.36 (Gradient of a Vector Field ) 

The gradient of a vector field, 0 : M n —> R m is 



'V0f 


" d<t> 1 
dx i 

dip i 

dx2 

dtpi ' 

dxn 

V0 = 


= 

dip 2 
dx i 

dip2 

dx2 

dip 2 

dxn 


VC. 


dlpm 

dlpm 

dlpm 



_ dxi 

dx2 

dXn . 


(18.136) 


which is also known as the Jacobian matrix. Notice that there is an inconsistency in the use of the gradient 
operator between the scalar and vector fields, where the former writes the gradient as a column vector 
and the latter as row vectors. This convention, used in most literature to reduce the number of transpose 
operations in equations involving the Jacobian [Cottle et al., 1992, Section 2.1.19]. When m = n = 3, 


we get 


'dip! 

dip! 

dipi ' 

dx 

dy 

dz 

dip2 

dip2 

d<p2 

dx 

dy 

dz 

dips 

dip3 

dips 

_ dx 

dy 

dz J 


(18.137) 


Sometimes only parts of the Jacobian are used, and a common notation is V r , where r is a vector of 
parameters to 0, e.g. r = [x l ,Xj,xf\ meaning 



- dip! 

dip! 

dip! - 


dxi 

dx. 

dx k 


dip2 

dip2 

dip2 

V r 0 = 

dxi 

dxj 

dx k 


dlpm. 

dlpm 

dlpm 


- dxi 

dxj 

dx k _ 


(18.138) 
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Definition 18.37 (Divergence of a Vector Field ) 

Applying the Nabla operator on a vector field <f> using the dot product gives the divergence scalar field: 


V • 0 = div 0, 


(18.139) 


which for 0 : R 3 


is given as, 


V-0 


<901 <902 , 903 

dx dy dz 


where 0 i, 02 , and 0 3 are the x, y, and z components of the vector fields, hence a scalar field. 
The divergence theorem or Gauss’ theorem is expressed using the divergence operator, 


(18.140) 


'V 


V ■ (f> dv = ® 0 • da = ® 0 -n da, 


(18.141) 


which states that the integral of the divergence of a vector field 0 over a closed volume v £ V is equivalent 
to the integral of flow through the surface a £ S, where n is the normal to the surface. In the limit of an 
infinitesimal volume, 


V 


= lim — 

v^o V 


0 • n da, 


(18.142) 


and thus it is seen, that the divergence is the point-wise flow through the surface of the point. 


Definition 18.38 (Rotation of a Vector Field ) 

Applying the Nabla operator on a vector field 0 using the cross product gives the rotation vector field: 


V X 0 = curl 0, 


which is also sometimes called rot. For 0 : R 3 —R 3 is given as 


~d<t> 3 

dip 2 " 

dy 

dfi 

dz 

dp 3 

4 

4 

dx 

dy . 


(18.143) 


(18.144) 


If the rotation of a vector field is zero, V x 0, then 0 is a gradient field, i.e., there exists a scalar field g 
such that 0 = V g. 

The rotation theorem or Stokes' theorem is expressed using the rotation operator: 


/ V x 0 • da = / V x 0 • n da = ® 0 • dl = ® 0 • t dl, (18.145) 

Js Js Jc Jc 

which states that rotation of a vector field along a closed curve 1 £ C is equal to the integral of the field 
over any surface a £ S, which has C as its border: The vectors n and t are the normal to the surface S 
and tangent to the curve C respectively. In the limit, 

n ■ (V x 0) = lim -!- ® 0 t dl, 

0 S Jc 

and we may interpret the normal part of the rotation of a vector field as rotation of the point. 


(18.146) 
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theorem 18.19 (Directional Derivative of a Scalar Field ) 

The change of a scalar field in the direction <fi may be written as 


df_ 

dcp 


(0-V)/. 


(18.147) 


When \cp\ = 1, then is called the directional derivative or ‘cp gradient /’. Assuming f(x, y, z ) : M 3 
M and <f) = [pi, p 2 , fif\ T G R 3 then 


df_ 

dcj) 


, df df df 

ox ay oz 


(18.148) 


Definition 18.39 (Directional Derivative of a Vector Field ) 

The change in a vector field 'ip, when moving along the vector field cj), is another vector field [(f) ■ V) ’if), 
which is called ‘(f) gradient if) 


dtp 

dtp 


(0 • v)-0 = (v</0 (f) 


(18.149) 


When \cj)\ = 1, then is called the directional derivative. For (f>,tf) : M 3 —* R 3 it is found that 


dip 

dcp 


[<h^+<h^+hm 


(18.150) 


In this case, the (p ■ V is applied component-wise on the elements ofxp. 


theorem 18.20 (Gradient of a Scalar-Scalar Product) 

The gradient of a (point-wise) product of two scalar fields, f and g, is a vector field given as 


V(/ 5 ) = <?V/ + fVg. (18.151) 

theorem 18.21 (Gradient of a Scalar-Vector Product) 

The gradient of a the (point-wise) product of a scalar field, f, and a vector field (p is 


V(/0) = /V<^ + 0(V/) T (18.152) 

theorem 18.22 (Divergence of a Scalar-Vector Product) 

The divergence of the (point-wise) product of a scalar field f and a vector field <p is a scalar given as 

V-(/0) = 0-(V/) + /(V-0). (18.153) 

theorem 18.23 (Rotation of a Scalar-Vector Product) 

The rotation of the (point-wise) product of a scalar field f and a vector field (p is a vector given as 


V x (/</>) = / (V x (f) + (V/) x <5 b . 


(18.154) 
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theorem 18.24 (Gradient of a Dot Product) 

The gradient of a (point-wise) dot product between two vector fields cp and ip is 
v(0 • Ip) = (ycp) T ip + ( Vip) T cp = (cp • V)ip + cp x (V x ip) + (ip • V)cp + ip x (V x 

theorem 18.25 (The Divergence of a Cross Product) 

The divergence of a (point-wise) cross product between two vector fields cp and ip is, 

V • (</> x ip) = (V x (p) ■ ip — <p ■ (V x ip). 

theorem 18.26 (The Rotation of a Cross Product) 

The rotation of a (point-wise) cross product between two vector fields <p and ip is, 


V x (ip x ip) = (V • ip) (p - (V • <p)ip + (ip -V) cp - {cp -V)ip. 


theorem 18.27 (The Divergence of a Gradient Field: The Laplace Operator ) 

The Laplace operator, 

V-V = V 2 , 

on a scalar field yields a scalar field, which for f : M 3 —> R gives 


V 2 f = ^/ 4. 

dx 2 dy 2 dz 2 


theorem 18.28 (The Rotation of a Gradient Field Vanishes) 

The rotation of a gradient of a scalar field is zero, 


V x (V/) = 0. 

theorem 18.29 (The Divergence of a Rotation Vanishes) 

The divergence of a rotation of a vector field is zero, 


V • (V x (p) = 0. 


theorem 18.30 (The Rotation of a Rotation) 

The Rotation of a Rotation is given as 


VxVx0 = V(V •</>)- V 2 0, 
where V 2 </> = [V 2 </>i, V 2 </> 2 , Y7 2 <pf\ T is used for shorthand. 

theorem 18.31 (Gradient of a Constant Linear Transformation of a Vector Field) 

The gradient of a vector field <p : R 3 —► R 3 under constant transformation A 6 R 3x3 is, 


595 


.). (18.155) 


(18.156) 


(18.157) 


(18.158) 


(18.159) 


(18.160) 


(18.161) 


(18.162) 


W(Acp) = AVcp 


(18.163) 
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Proof of Proposition 18.31: 

To prove we first expand the equation as 


V(A0) = V 


<21101 + 01202 + 01303 

O2101 + O22 02 + 02303 
03101 + 03202 + 03303 


0(an0i+ai202+ai303) 

0(an0l+ai2 02+ai303) 

9(an0i+ai2 02+ai303) 

dx 

dy 

dz 

d(a21<t>l+CL22(t>2+a23<i>3) 

d(a 2101 +d22 02 +«23 03 ) 

9{a 2101 +022 02 +«23 03 ) 

dx 

dy 

dz 

d(a$\ 01+C132 §2+a,33<f>3) 

C*(a.3101+a32 02+a3303) 

d (a.3101 + a32 02 + £133 03) 

dx 

dy 

dz 


Considering the first element we find. 


< 9(01101 + 01202 + 01303 ) 
dx 


<901 . <902 . <903 

®ii“S-h °i 2 ^- 1 - a 13 ——, 

ox ox ox 


(18.164a) 


(18.164b) 


(18.165) 


since the elements of A arc constant. Extending the calculation on the first element to the full matrix and 
separating the terms gives, 


V (A0) = 

an 

021 

a 12 

a22 

«13 

023 


-Si ^ -Ivl ^ 

i_ 

901 

dy 

002 

dy 

<41? <41? 

_i 


031 

032 

033_ 


_i 

003 

dy 

-1 

co 1^ 


= AV<f>. 


(18.166a) 

(18.166b) 


This completes our proof. 


□ 


theorem 18.32 (Directional Derivative of a Constant Linear Transformation of a Vector Field) 

The change in direction 0 : R 3 — > R 3 of a vector -0 : R 3 —> R 3 under constant transformation A £ R 3x3 
is, 

= (0 ' V)A0 = A ((0 • V)0) = A^. (18.167) 

Proof of Proposition 18.32: 

To prove, we write 


which completes our proof. 


(0- V)A0 


(V(A0))0 

(18.168a) 

(AV0) 0 

(18.168b) 

A((V0) 0) 

(18.168c) 

A (0 • V) 0 

(18.168d) 


□ 


theorem 18.33 (Gradient of a Constant Quadratic Form of Vector Fields) 

For 0, 0 : R 3 —r R 3 and a constant transformation A £ R 3x3 the gradient of the quadratic form cf) 1 A0 
is given by, 


V(0 T A0) = (V0) t (A0) + (V0) T A T 0. 


(18.169) 
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Proof of Proposition 18.33: 

Expanding the equation we find. 


V(0 T A0) 


v(0 • A0) 

(18.170a) 

(V0) t (A 0) + (V(A0)) t 0 

(18.170b) 

(V0) t A 0 + (AV0) t 0 

(18.170c) 

(V0) t A0 + (V0) t A t 0. 

(18.170d) 

□ 


In the special case where 0 = -0, we find that V(0 r A0) = (V0) 1 (A + A T )0. 


18.4 Functional Derivatives 

Functional derivatives arc a generalization of the derivative operator that occurs in calculus of variation. 
Instead of differentiation, a function w.r.t. a variable, a functional is differentiated w.r.t. a function. See 
Chapter 21 for details on the sense of functional derivatives. The essence is that in functional derivatives, 
it is allowed take derivatives w.r.t. function names. 

theorem 18.34 (Functional Derivative of a Scalar Field) 

Given a scalar field, f(x, u ), where x is the independent variable and u is the dependent variable, i.e., 
u{x). The functional derivative w.r.t. u is The functional derivative treats dependent variables as 
independent w.r.t. to differentiation. 

The above proposition is best understood with an example. 

theorem 18.35 (Functional Derivative of a Scalar-Scalar Product) 

Given two scalar fields, f,g, the functional derivative of their product w.r.t. a dependent vector field u is 

du(fg) = gd u f + fd u g (18.171) 

Proof of Proposition 18.35: 

Write the functional derivative using the specialized gradient operator as 

d u (fg) = 


Vu(fg) 

' d(fg) 

du\ 

djfg) 

dii2 

djfg) 

. dus J 

d(f) , n d(g) 
9(f) rdtfl 

& dus ' ^ du3 . 

gd u f + fd u g 


(18.172a) 

(18.172b) 

(18.172c) 


(18.172d) 

□ 
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theorem 18.36 (Functional Derivative of a Dot Product) 

Given two vector fields <f>, rp, the functional derivative of their dot product w.r.t. a dependent vector field 


u is 

d u (0 ■ if) = (d u (/)) T fi> + {d u fir) T 4> (18.173) 

Proof of Proposition 18.36: 

Assume that 0, fit £ M 3 , and that they are depending on u. then we expand the dot product as 

d u (0 • fit) = d u (4>i V’l + 0202 + 0303) (18.174a) 

= 0-u(0i0i) + 0^(0202) + d u (fi 3 f 3 ) (18.174b) 

= 0i0-u(0i) + 4>id u (fi) + 020u(02) + (t>2d u (i>2) + 0301003) + f 3 d u (i> 3 ) (18.174c) 

= {d u fi>) T fir + (d u fir) T <fi. (18.174d) 

This completes our proof. □ 

A special case is the functional derivative of a squared length, 

d u (0 T 0) = 2 (d u (j)) T <fi. (18.175) 


theorem 18.37 (Functional Derivative of an Independent Linear Transformation) 

The functional derivative of an independent transformation of a vector field <fi w.r.t. the vector field u is, 


0 U (A0) = A0u0 (18.176) 

Proof of Proposition 18.37: 

Assume that 0 : M 3 — ► M 3 and A G M 3x3 , then 


d u (A<fi) 


d„ 


anfii + ay 2 (t >2 + ai3</>3 
02101 + 02202 + 02303 
03101 + 03202 + 03303 


<9u(oil01 + 01202 + 01303) 
0 u(o 2101 + 02202 + 02303) 
0tt(a3101 + 03202 + O3303)_ 

0110^01 + Oi 20 it 02 + 0130^03 
02101401 + 02201402 + 0230-1403 
03101401 + O320U02 + 03301403 


Ad u fi>. 


(18.177a) 


(18.177b) 


(18.177c) 

(18.177d) 


This completes our proof. 


□ 


theorem 18.38 (Functional Derivatives of a constant Quadratic Form) 

Given two vector fields <0 and fir and an independent linear transformation A, the functional derivative 
w.r.t. the vector field u is 


014(00 A fir) = (dufir) 1 Afir + {d u fir) T A r fi> 


(18.178) 
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Proof of Proposition 18.38: 


d u {<P T A'tp) 


<9 U (0 • {Aip)) 

(18.179a) 

{d u <fi) T (A0) + {d u (Atp)) T cp 

(18.179b) 

{d u cp) T Aip + ( Ad u ip) T cp 

(18.179c) 

(d u cp) ri Atp + {d u tp) T A T cp 

(18.179d) 
□ 


In the special case, where 0 = 0, then d u {(p 7 Ac/)) = {d u cp) T ( A + A 7 ) 0. 

theorem 18.39 (Functional Derivative of a Normal Vector) 

Consider two vector fields cp and 0. If 0 is independent of ip then the functional derivative of their cross 
product w.r.t. 0 is 


d^f, (0x0) = —c0, (0 x 0) (18.180a) 

= -^(/ x 0 ) (18.180b) 

= -/ x c00 (18.180c) 

= -/ x . (18.180d) 


Likewise, iff) is independent of 0 //;<?/( the functional derivative of their cross product w.r.t. 0 is 

c0(0 x 0) = g x . (18.181) 

Now consider a normal vector field given by 

0X0 


//.s derivative w.r.t. 0 is 


d^n — ( 0 , 
= < 9-0 


n = 


0X0 


|0 X 0|' 


!!</> x 0ll 2 y 

0 X 0 


(0 x 0 • 0 x 0) 1 / 2 / 

(W X 0) ((0 X 0) T (0 X 0)) 


Using (18.152) we find 


d^n = ((0, (0 x 0)) (0x0) (0x0)) +(0x0) 


^ (0X0) (0 X 


(18.182) 

(18.183a) 

(18.183b) 

(18.183c) 


(18.184) 
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and using (18.180d) and (18.173) on the square brackets term, we get 

nq, = ~f x ({ip x 4>) t {ip x 
Simplifying by using Proposition 18.5 get 


2 ~ \{^ x <p) ({ip x cp) T {ip x 4>)\ 2 2 (f x f x ) T (ip x (p) 

(18.185) 


dq>n = -f x ( ( f xT 


/ x > - / 


/ xi >j ' {-f x ^p)[f xT f xT {-r^p) 

(18.186a) 

f x (Vf x f xT ^y~ 2 - [Vf x f xT ^y~ 2 f x ^ T f xT f x f x (18.186b) 


l T 


18.5 Quaternions 

A quaternion is a 4D complex number, which may be used to represent rotations in 3D space. A thorough 
treatment of quaternions is given in [Dam et al., 1998]. 

Definition 18.40 (Quaternion) 

A quaternion is defined to be a real number and three imaginary numbers. Typically a quaternion q is 
written as: 

q = s + xi + yj + zk, (18.187) 

where i, j, and k are the imaginary axes. Alternatively, the quaternion q may be written as 

q = [ S) v ] = [ S) ( x , y, z )] = [s, x, y, z} . (18.188) 

Definition 18.41 (Quaternion Product) 

Two quaternions q = [s, i>] and q' = [s', v'] are multiplied with each other as follows: 

qq = [ss' — v ■ v', sv' + s'v + v x v'~\ . (18.189) 

Definition 18.42 (Conjugated Quaternion) 

The conjugated quaternion of q = [s, v] is defined to be 

g * = [ Sj -t»]. (18.190) 

theorem 18.40 (Conjugate of a Product of Quaternions) 

For the conjugated of the product of two quaternions: q = [s, (x , y, z)\ and q' = [s', (. x ', y', z')\ we have 

(qq'r=q'*q*. (18.191) 
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Proof of Proposition 18.40: 

We will prove the equality by straightforward computation of the left-hand side and the right-hand side. 


From the left-hand side we have 

(qq')* = [ss / — v ■ v', sv' + s'v + v x v']* (18.192a) 

= [ss ; — v ■ v', —sv' — s'v — v x v'~\ , (18.192b) 

and from the right-hand side we get 

q'*q* = [ s ', -„'][s, -v] (18.193a) 

= [ss / — v ■ v', —s'v — sv' + v' x -y] (18.193b) 

= [ss / — v ■ v' , —s'v — sv' — v x v']. (18.193c) 

By comparison, we discover that the left-hand side is equal to the right-hand side. This completes our 

proof. □ 

Definition 18.43 (Inner Product of Quaternions) 


The inner product between two quaternions q = [s, (x, y, z)\ and q' = [s', (x', y', z')\ is defined as follows 


q ■ q' = ss' + v ■ v'. (18.194) 

Definition 18.44 (The Norm of a Quaternion) 

The norm (or length) of a quaternion is defined by 

IMI = \[q r q = \fqt- (18.195) 

theorem 18.41 (An Unit Quaternion is a Rotation) 

A quaternion, with norm equal to 1, is called a unit quaternion. If we have the unit quaternion, 

q = [cos 9, rising], (18.196) 

then it represents a rotation of 26 radians around the normalized axis n. 

theorem 18.42 (Quaternion Rotation ) 

Given p = [0, r] and a unit quaternion q = [cos 6, n sin 6], then the rotation of p is given by, 

p = qpq* (18.197) 

where p 1 is p rotated by 26 radians around the axis n. 

theorem 18.43 (Compound Rotations) 

Given p = [0,r] and two unit quaternions q and q', then the rotation of p first by q and then with q' is 
given by the quaternion 

r = qq 

The proposition states that composition of rotations is equivalent to product of quaternions. 


(18.198) 
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Proof of Proposition 18.43: 

From Proposition 18.42 we have: 

P = QPQ* 

now p' must be rotated by q', using the same proposition yields: 

// / / /* 
p = q p q 

= q'(qpq*W* 

= (q'q)p(q'q)* 

= r ,p r * 

which is what we wanted to prove. 


(18.199) 

(18.200a) 

(18.200b) 

(18.200c) 

(18.200d) 

□ 


theorem 18.44 (Conversion from Rotation Matrix to Quaternion) 

Given a rotation matrix, M, then the corresponding quaternion q = [s, (x, y, z)\ is given by: 


s 

x 

y 

z 


Mil + AT 22 + AT 33 + 1 
M 32 + M 23 
4.s 

M i 3 + M 3 i 
4s 

M21 + AT 12 
4s 


(18.201a) 

(18.201b) 

(18.201c) 

(18.201d) 


If the rotation angle is 7 r radians (180 degrees) then the value of the trace of the rotation matrix is — 1. 
This means that the value of s will be zero, and it will become impossible to compute the x, y, and z 
values. 

If the conversion is done on a computer then one would also have to consider the numerical inaccu¬ 
racies as the trace of the rotation matrix gets close to —1. 


theorem 18.45 (Conversion from Quaternion to Rotation Matrix) 

Given a unit quaternion q = [s, (x, y, z)\, the corresponding rotation matrix is: 

/ 1 — 2 (y 2 + z 2 ) 2 xy — 2 sz 2 sy + 2 xz \ 

M = j 2xy + 2sz 1 — 2(x 2 + z 2 ) —2sx + 2yz J (18.202) 

y —2sy + 2xz 2sx + 2yz 1 — 2 {x 2 + y 2 ) ) 

theorem 18.46 (The Time Derivative of the Quaternion) 

Given a body with an orientation represented by the rotation q(t 0 ), and an angular velocity ofu>(t), then 
the time derivative of the orientation is given by: 

q(to) = ^[0,w(t 0 )]g(to) (18.203) 

Using lo = [0,w(t 0 )], this can be written as: 

q = ^ujq (18.204) 
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Proof of Proposition 18.46: 

The angular velocity u>(t) means that the body rotates ||co(t)|| radians per second around the axis given 
by . After At seconds the body must be rotated by: 


r(t) 


co 


cos 


At u>(t) ||w 

■ sin — 


At 


co 


(18.205) 


Where At is some infinitely small value, such that we can assume that the angular acceleration is effec¬ 
tively constant. The orientation of the body after At seconds must then be: 


q(t 0 + At) = r(to)q(t 0 ) 


(18.206) 


If we substitute t = to + At and the expression for r(to), then we have the following equation for the 
orientation of the body: 


r IM*o)|| a (*-*o) w(t 0 ) . ll"(to )|| 2 (t~to) 

,( t ) = [cos-1-, R ^ r ...-§-]?(to) 


(18.207) 


Now all we need to do is to differentiate this equation with respect to time and remember to evaluate the 
final expression at time t = to (this is essentially the same as taking the limit of q(to + At) — (/(to) as At 
goes to zero). Doing so yields: 


m = ± [cos 


= dt [cos 


Mto)|| a (t-to) . ||w(t 0 )|| (t — to) co(t 0 ) w , c 

— 2 —— 2 — 

Mto)|| a (t-to) . ||w(t 0 )|| (t- to) co(t 0 ) 

2 -, sin- 2 


d_ 

dt 


cos ■ 


2 2 ||w(t 0 )|| 2 j 

w (to)|| 2 (t-t 0 )\ d(. ||w(t 0 )|| 2 (t-t 0 ) CO (to) 


q{to) 


’ dt 


sin • 


l w (*o)||. 


(18.208a) 
(18.208b) 
(/(to) (18.208c) 


l|w(t 0 )|L . ||co(t 0 )|L (t-to) Mto)|| 


■ sin ■ 


lk(to)|| 2 . ||w(to)|| 2 


■ sinO, ■ 


2 ’ 2 
|co(to)|| 2 CO (t 0 ) 


cos 0- 


c^(tp) 

|w(to)|| s 


2 IMt 0 )|| 2 (t-to) CO (to) 

- cos- l - 


|w(to)||. 


q (to) 


l|w(to)IL 


q(t o) 


= 2 [O,co(t 0 )]q(to) 


3 (to) 
(18.208d) 

(18.208e) 

(18.208f) 

(18.208g) 

□ 


This completes the proof. 
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theorem 18.47 (Quaternion Matrix ) 


where 


1 

2 urn = 

QiVi, 



- Xi 

~Vi 

-Zi 

l 

Si 

Zi 

~Vi 

2 

-Zi 

Si 

Xi 


Vi 

-Xi 

Si 


(18.209) 


(18.210) 


Proof of Proposition 18.47: 

Let us start by writing the quaternion product rule, for two quaternions q = [s, v] and q' = [s', v'], 


qq = [ss' — v ■ v', sv' + s'v + v X v'] . 

We will now use this product rule on the left side of (18.209), and we get 

1 1 r 

iQi = 2 

= [Os* - ui ■ Vi, 0 Vi + SiU>i + u>i x Vi\ 

= ^ [-Vi ■ U>i, SitVi - Vi X Ui\ 

_ 1 
~~ 2 



i 

05 

0 

1 

O 


T 

~ V i U>i> 

0 

Si 

0 

Vi - Vi* LVi 


1 

o 

0 

Si_ 



(18.211) 

(18.212a) 

(18.212b) 

(18.212c) 

(18.212d) 


We arc now almost finished. We only have to remember that v, = [x t , y t , z,\ and use the definition of 
the cross product matrix (18.65), this yields 


1 


:VM = 


1 

r 



2 

[-Xi 

~Vi 

~Zi 


-Xi 

~Vi 

-Zi 

1 

Si 

Zi 

~Vi 

2 

-Zi 

Si 

Xi 


Vi 

-Xi 

Si 


( 

05 

0 

o' 



0 

Si 

0 

— 

\ 

o 

0 

Si_ 



0 -Zi yi 
Zi 0 -Xi 
~Vi Xi 0 


UJi 


Vi 


(18.213a) 


(18.213b) 


Comparing this with the (18.210) for Q t we conclude that 


\= Qi^i- 


We have proved what we wanted. 


(18.214) 

□ 
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Solving Linear Systems of Equations 


Often in physics-based animation, we have to solve sets of linear equations. Popular sources for algorithms 
arc [Press et al., 1999a, Golub et al., 1996], which we follow in this chapter. 

A linear set of equations may be written as 


aim + ai 2 r 2 H-1- a 1N r N = h, 

a 2 rn + a 22 r 2 H-h a 2 NTN = b 2 , 

OMlO + a M 2 T 2 -\ -+ ClMN r N = t>M, (19. lc) 

or in matrix form as 

Ar = b, (19.2) 

where A = {a^} £ W NxM , r = [rq, r 2 ,..., r/v] T , and b = [b\, b 2 ,... , &at] t . The typical situation is 
that A and b are given, and that we wish to solve for r. As an example, consider the equations 

2ri + 4?’ 2 = 4, 

-3?’i + r 2 = 2, 

which are shown in Figure 19.1. The solution is the point where the lines cross at [—2/7,8/7 ] T . 

When M < N there are fewer equations than unknowns, and the system cannot be solved without ex¬ 
tra constraints. In the case that M = N, there exists a solution as long as det A/0 (see Definition 18.23 
for a definition of the determinant). If det A = 0, then the system is degenerated and can be reduced to 
yield a system of equations with fewer than M equations. Numerically speaking, the determinant may 
be near the round-off error, and is thus considered as a degenerated system of equations. In the case that 
M > N, then the system is over constrained and there is typically not an exact solution, in which case 
only an approximation may be found, which minimizes E(r) = || Ar — 6||. 

In the following, we will assume that M = N and that det A/0. The conceptually simplest method 
for solving for r in (19.2) is to invert A and apply it on both sides of the equation 

r = A l b. (19.4) 


(19.3a) 

(19.3b) 


(19.1a) 

(19.1b) 


19.1 Gauss Elimination 

Gauss elimination is a method for inverting the matrix, in which case it is also referred to as Gauss-Jordan 
elimination , and when followed by back substitution may be used to solve a system of linear equations 
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Figure 19.1: Two lines given by the implicit equations in (19.3a)—solid and (19.3b)—dashed. 


directly. It is often used as an introductory method, although it neither is the most memory conserving nor 
the fastest method. 

For Gauss elimination we just consider a system of equations, Ar = b, while for Gauss-Jordan 
elimination, the system (19.2) is augmented with the inverse matrix C = A 1 , such that AC = AA 1 = 
1, where 1 is the identity matrix. We write as follows: 

A [r | C] = [b | 1] (19.5a) 


a\\ 

ai2 ■ 

■ ■ am 


ri 

cn 

C12 • • 

■ ■ Cin 


'bi 

1 

0 ., 

,. o' 

(l‘2\ 

022 • 

■ ■ 027V 


?’2 

C21 

C22 • ■ 

■ ■ C2N 

= 

b 2 

0 

1 .. 

,. 0 

Qni 

am ■ 

■ • o,nn_ 


r N 

CNl 

CN2 ■ ■ 

■ ■ cnn_ 


b N 

0 

0 ., 

,. 1 


Gauss elimination and Gauss-Jordan elimination rely on the same process of reordering information in 
the above equation without destroying its solution, such that the coefficients in A arc replaced with an 
identity matrix, in which case C will be replaced with A -1 and r will contain the solution to the system. 
The system on form (19.5) may be rearranged in three different ways without destroying the solution to 
(19.2): 


Interchange two rows in A: 

Since Ar = b and AC = 1 arc decoupled in (19.5), we realize that interchanging two rows in A 
requires that the corresponding rows in both b and 1 arc interchanged, and amounts to writing the 



0 


B 
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original equations in a different order. As an example, interchanging rows one and two is equivalent 
to writing the system of equations as 


a 2 m + a 22 r 2 4-4 a 2N r N = b 2 , 

aim + a i2 r 2 4-h a XN r N = h, 


(19.6a) 

(19.6b) 


oktvm + aAr 2 r 2 4-4- clnnvn = bjy, 

which clearly does not change its solution. The same argument holds for AC = 1. 


(19.6c) 


Interchange two columns in A: 

Interchanging two columns in A implies that the corresponding rows in r and C must be inter¬ 
changed. As an example, interchanging columns one and two is equivalent to writing the system of 
equations as 


a 12 r 2 4- aim 4-b anvrjv = h, 

a 22 r 2 + a 2 m 4-b a 2N r N = b 2 , 


(19.7a) 

(19.7b) 


aiv 2 r 2 4- ajvm 4-4- (Inn^n = &tv, 


(19.7c) 


which clearly does not change its solution. The same argument holds for AC = 1. However, in 
contrast to the interchanging of rows, the first element of the solution in r is now to be found on 
position two, hence, we need to track the interchanging of columns in order to unscramble the final 
solution to the original ordering. 

Replace a row in A with the linear combination of itself with any other row in A: 

Replacing a row in A with the linear combination of itself with any other row in A will not destroy 
the solution as long as the same linear combination is performed on the same rows in b and 1. When 
using A ri to denote the fth row in A, we may, as an example, replace row two with A 2 * — —Ai*, 
which is equivalent to replacing the second equation in (19.1) with 


(a 2 m + a 22 r 2 4- 


4- (■ a 2N )r N ) - — (aim + a i2 r 2 4- 
an 


+ aiArrjv) = b 1 - —b 2 , 

a ii 


(19.8a) 


4 


a 2i \ ( a 2i 

a 2 i-an n 4- a 22 -a i2 r 2 + 

an J V an 


a 2 i 

a 22 -ai 2 ] r 2 4- 

au 


a 2 i 


an 


a 2 i 


a 2 i. 


4- a 2 iv-aiA? ) vn = b\ - —b 2 


an 


(19.8b) 


4 


an 


a 2 i. 


4- a 2 iv-anv ) tn = b\ - —b 2 


an 


(19.8c) 
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The same argument holds for AC = 1. 


In (19.8), we see that the dependency on the first equation in the second equation may be factored out. 
This is the essence of elimination; that is, if we first eliminate the dependency on the first equation in 
equations 2... N, then we may eliminate the dependency on the resulting second equation in resulting 
3 ... N equations and recurse until we have obtained an equation of the form 


Ur = b (19.9) 
where U = {uij} is an upper triangular matrix, that is, 


lilin + Uur 2 + ■ ■ 

■ • + u 1 N r N 

= b\, 

(19.10a) 

U22V2 + ■ ' 

' • + U2NTN 

= b' 2 , 

(19.10b) 


u NN r N 

= &JV> 

(19.10c) 


for Gauss elimination or with the added and reordered matrices C and 1' for Gauss-Jordan. 

For Gauss-Jordan elimination, the reordered matrix C is now equal to the inverse of A, and we may 
thus use this to compute the solution for r. In the case of Gauss elimination, we may perform back 

p 

substitution, as follows: the last equation in (19.10) is to solve for r n = which in turn may be use 
to solve for r ; .y_ i = - , and recursively for the rest of r. 

Both the Gauss-elimination and the Gauss-Jordan elimination arc numerically unstable, since if the 
leading coefficient of the term to be eliminated is zero, then elimination is impossible; that is, elimination 
will fail in (19.8) if a \ i = 0 or close to the round-off error. The solution is to perform a pivot ; that is, 
interchange rows and/or columns to avoid such a situation. Although not completely safe, it is usually 
enough to interchange rows such that the equation with the largest leading coefficient is to be eliminated 
in the remaining equations. 


19.2 LU Decomposition 

Any matrix A may be factorized into two triangular matrices, 

LU = A (19.11) 
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such that L and U arc lower and upper triangular matrices, 


hi 

0 

0 

... 0 

hi 

h‘i 

0 

... 0 

Ini 

In2 


• • • Inn 

Uu 

Ul 2 

u\:i 

■ ■ ■ UNN 

0 

U22 

U23 

• • • U2N 

0 

0 

0 

■ ■ ■ UNN 


(19.12a) 


(19.12b) 


This is known as LU decomposition. Using LU decomposition, we may solve (19.2) by back substitution 
in two steps. That is, write (19.2) as 


LUr = b, 

(19.13a) 



Lp = b 

(19.13b) 


where 

p = Ur. (19.14) 


That is, using forward substitution, which is equivalent to back substitution, may be used first to solve for 
p in (19.13b) and then back substitution may be used to solve for r in (19.14). 

The triangular matrices L and U contain N 2 + N unknowns, while A only has N 2 coefficients, since 
the diagonal is nonzero in both L and U. As a consequence, we must find further N equations to solve 
for L and U in (19.11). In Croat’s algorithm 

[Press et al., 1999a, p. 44] l n = 1 is used, in which case the coefficient a tJ may be written as 


Q>ij — H* • 


(19.15) 
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algorithm Crout(A) 

N = size(A) 

L = zeros(N,N) 

U = zeros(N,N) 
for i=l : N 

Mi/ i) = 1; 

for j=l:N 
for i=l:j 

U (i,j) = A(i/j) - L(i, 1:i-1)*U(1:i-1,j) 
for i=j+l:N 

L(i,j) = (A(i,j) - L(i,l:j-l)*U(l:j-1,j)) 

end 

end 


Figure 19.2: Pseudocode for Clout’s algorithm for LU decomposition without pivot. 


Examining the first few coefficients, we find that 

an = Znitii = tin, (19.16a) 

ai2 = (nai2 = tin, (19.16b) 

<321 = ^2lHll, (19.16c) 

<322 = Z 21 H 12 + I 22 U 22 = ^2lM2 + 1122, (19.16d) 

<323 = ^2lHl3 + I 22 U 23 = I 21 U 13 + 1123, (19.16e) 

<331 = hiun, (19.16f) 

<332 = I 31 U 12 + I 32 U 22 , (19.16g) 

<333 = I31U13 + I32U23 + I33U33 = I31U13 + I32U23 + 1133, (19.16h) 

<334 = I 31 U 14 + I 32 U 24 + I 33 U 34 = hi^u + I 32 U 24 + H34, (19.16i) 


Thus, we may use a\ + to solve for u \ + in (19.16a), (19.16b), etc., and using this knowledge we may solve 
for /*i in (19.16c), (19.16f), etc. We now only need to determine the coefficients of L and U in the 
(2 ... N) x (2 ... N) submatrices. In (19.16d) we realize that the coefficients / 2 i and it 12 are now known. 
Hence, we may solve for U 22 , and likewise we may solve for U 23 in (19.16e) and the remaining coefficients 
in 112 *• Having uu, Z*i, and 112 * we may solve for Z *2 in (19.16g) etc., and we thus only need to determine 
the coefficients of L and U in the (3 ... N) x (3 ... N) submatrices, and the remaining coefficients follow 
by induction. This is the basis of Crout’s algorithm, which is shown in Figure 19.2. Pivoting is also paid 
of Crout’s algorithm and is discussed, for example, in [Press et al., 1999a, p. 46]; however, we will omit 
this for clarity. 
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In the special case when A is symmetric and positive-definite, (see Section 18.2 for a definition), then 
we may write 

LL T = A (19.17) 

and the LU-decomposition becomes particularly simple, since the last loop in Figure 19.2 is not needed 
because of the symmetry. This is known as Choleski decomposition , and it can be shown that Cholesky 
decomposition is very stable and needs no pivoting. 


19.3 Singular Value Decomposition 

Singular value decomposition ( SVD ) is a method for solving for r in 

Ar = b, (19.18) 

when r G M n , b £ M m , and A £ W nxn ,m > n is singular or close to singular. The fact that m > 
n implies that there are more equations than unknown, in which case the system is called and over¬ 
constrained system. 

Any matrix A £ M mxn , m > n can be written on the following form: 

A = UDV t , (19.19) 

where D £ M nxn is a diagonal matrix, and where U £ M mXT! and V £ M nxn are both orthogonal, i.e., 
their column vectors form an orthonormal basis. For these reasons, the inverse of A is immediately found 
to be 

A' 1 = (19.20) 

where (D 1 ).. = 4-. However, when A is singular or close to singular, then there will be diagonal 
elements da, which arc close to zero, and D 1 will have diagonal values that arc near infinity. Singularity 
of A means that there arc areas in O C M n , where Ar = 0. f £ (>. The set O is called the nullspace. A 
consequence is that r may be added to any given solution r ^ H, while still fulfilling the equation, 

A (r + r) = Ar = b, (19.21) 

The implication is that the solution is undetermined, and we must choose a value. There are two cases to 
consider: either b is reachable for some r, in which case b is said to be in the range of A, or otherwise 
there is no r which solves the system. In both cases, the choice by singular value decomposition is to set 
■j- = 0 in (19.20), when da — 0. When b is in the range of A, this is equivalent to picking the value of r, 
which minimizes ||r|| . When b is not in the range of A, this is equivalent to picking r which minimizes 
|| Ar — 6 || . See [Press et al., 1999a, p. 62] for a simple proof. An algorithm for computing the singular 
value decomposition is given in [Golub et al., 1996, Section 8.6]. 
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19.4 Linear Least Squares 

A method related to singular value decomposition is the method of linear least squares , which as singular 
value decomposition seeks the minimum of 

r* = argmin II Ar — foil 2 . (19.22) 

r 2 

The origin of the linear least square problem is fitting: given a set of data-values {xj,yj},j = 1... m, 
we wish to find the parameters a G M m of a function / such as the coefficients of the polynomial 

m 

f(a,x) = '^2a i x l ~ 1 , (19.23) 

i =1 


which minimizes the quadratic distance 



Vi 


2 


(19.24) 


The coefficients rj. ; arc a measure of error on the j’th data-value, and if nothing is known or presupposed, 
then <7j = I. We will assume in the following that a 3 = I. In general, y j . f , x G M n are vector functions, 
in which case we write 

m 

f( x ) = ^2 a,iXi(x) = X x a, (19.25) 

1=1 

where Xi (x) are any functions but often form an orthonormal basis. Since we give m data values, we can 
at most determine m parameters, hence, a G M m and we may fix X = (x 3 ), X G M nxm , which is 

called design matrix. Thus, our minimization problem is reduced to 


a* = arg min II Xa — y II , 

a 2 


(19.26) 


and we find complete resemblance with the singular value decomposition method. Minimum is obtained 
when 


9 || Xa- y || 
da 


= 0 . 


To obtain the partial derivative we first rewrite the squared Euclidean norm 


(19.27) 


Xa-y\\l = (Xa-y) T (Xa-y) (19.28a) 

= a T X T Xa - a T X T y - y T Xa - y T y (19.28b) 

= a T X T Xa - 2 y T Xa - y T y, (19.28c) 
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using a 7 X 1 y = y 1 Xa, since scalars arc invariant to the transpose operator. Following Chapter 18.3.1, 
we perform the mechanical exercise of calculating the differential in order to evaluate (19.27): 

d || Xa - y\\ 2 2 = d (a T X T Xa - 2 y T Xa - y T y) (19.29a) 

= (da T ) X T Xa + a 7 ( dX T ) Xa + a T X T (dX) a + a T X T X da 
— 2 (( dy T ) Xa + y 7 (dX) a + y T X da) 

~ (( dy T ) y + y T dy) . (19.29b) 


We arc only interested in the linear terms involving da, and using sloppy notation, we write 

3 ||Xa - y\\] _ ( da T ) X T Xa + a T X T X da - 2 y T X da 
da da 

= 2 a T X T X - 2 y T X 1 


(19.30a) 

(19.30b) 


where we again made use of the invariance of scalars to the transpose operator. The coefficients a that 
optimize (19.26) arc found by solving a 7 X 7 X — y 1 X = 0, or equivalently by rearrangement and 
application of the transpose operator 

X T Xa = X T y. (19.31) 

This may be solved using Gauss-elimination. The matrix X 7 X is positive-definite hence invertible, 
and this is often written as a = (X 7 X) X 1 y, where the matrix ( X 7 X ) X 1 is known as the 
pseudoinverse, and the system may be solved using Cholesky decomposition. 


19.5 The Jacobi and Gauss-Seidel Methods 

Iterative methods for solving linear systems of equations arc often rejected in the classical literature be¬ 
cause they converge very slowly toward an accurate solution. In animation, high precision is most often 
not a major concern, and the iterative solutions arc favored, since they very quickly produce solutions that 
arc sufficiently accurate for typical animation problems. 

An iterative method can be stated as 


r k+1 = f(r k ), (19.32) 

where the superscript k denotes the iteration. The iteration is continued until a fixed point is reached, 

r * = f(r*). (19.33) 

Flere, r * is a solution, and the limit of the sequence {r °, r 1 ..... r * }. Converges will be discussed later 
in Section 19.7. Iterative methods arc very effective for solving large sparse matrix systems of the form 
(19.2). The first step to bring such a system onto the form of (19.32) is to decompose A into a sum of 
three matrices: a strictly lower matrix L, a diagonal matrix D, and a strictly upper matrix U, such that 

A = L + D + U. (19.34) 
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This implies that 


(L + D + U) r = b, 

Dr = b - (L + U) r. 


(19.35a) 

(19.35b) 


As long as A and hence D have nonzero diagonal elements, we can invert D and write 

r = D x b- D~ l {L + U)r 1 (19.36a) 

r = c + Tr , (19.36b) 

where c = D 1 b, and T = —D~ 1 (L + U). Thus we have found a linear equation of (19.2) on the form 

(19.33). If the vector r is a solution to the system, then (19.36) is on a fixed point. If r is not a solution, 

we may use (19.36) as an update rule by imposing an iteration number as 

r k+l =c + Tr k , (19.37) 


All iterative methods on this form are called stationary iterative methods, if neither the matrix T nor the 
vector c depend upon the iteration count k. In the following, we will treat stationary methods and defer 
nonstationary methods until Section 19.9. 

Since D is a diagonal matrix, then D 1 is also diagonal, and it is very easy to show that (D 1 ) . = 

—. Hence, the iterative scheme may be calculated as 

a n : 


r 


I, _ V*- 1 /. r k _ V «- 

u t 2 ^i=o %r j 2^1= 


n— 1 


Uj ,-r,- 


H,l 


(19.38) 


This scheme is the Jacobi method and a pseudocode version is shown in Figure 19.3 From the figure it can 
be seen that the algorithm iterates until a convergence test succeeds. Stopping criteria will be discussed in 
detail in Section 19.7. 

Looking closely at (19.38), we see that when updating r k+1 , then we have already computed rj ' 1 1 
for all j < i, and we may use the most recent values in the computation of r k+1 to get a faster solution. 
Our update rule now becomes 


^>#+i —. 


h - 1 r fc+i — V n-J 

u i J~j j= 0 h,j I j Z^/j=i 


fc+r 


1 


j=i +1 U kJ' j 


(Xj ' 


(19.39) 


which is known as Gauss-Seidel method. Pseudocode for the Gauss-Seidel method is listed in Figure 19.4. 
For a closer examination of the Gauss-Seidel method, we rewrite (19.39) by multiplying both sides by a^i 
and collecting all k + l’th terms on the left-hand side. 


i— 1 


71—1 




k +1 


+J2ki r 3 k+1 = b i- E 


UijTj 


3 =0 


j=i +1 


(19.40) 
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algorithm Jacobi(A,x,b) 
r = initial guess 
for k=0,1,2,3,... do 
for i=0 to n-1 do 
delta = 0 
for j = 0 to i-1 do 

delta += A(i,j)*r(j) 
next j 

for j=i+l to n-1 do 
delta += A(i,j)*r(j) 
next j 

rNext(i) = (b(i) - delta)/A(i,i) 
next i 
r = rNext 

check convergence, continue if necessary 
next k 

end algorithm 


Figure 19.3: Pseudocode for the Jacobi method. 


algorithm Gauss-Seidel(A,x,b) 
r = initial guess 
for k=0,1,2,3,... do 
for i=0 to n-1 do 
delta = 0 
for j = 0 to i-1 do 

delta += A(i,j)*r(j) 
next j 

for j=i+l to n-1 do 
delta += A(i,j)*r(j) 
next j 

r(i) = (b(i) - delta)/A(i,i) 
next i 

check convergence, continue if necessary 
next k 

end algorithm 


Figure 19.4: The Gauss-Seidel method. 
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It now follows that 


(D + L)r k+1 = b - Ur k , 

(19.41) 

and by inverting D + L we find that 


r k+1 = (D + L ) _1 (b- Ur k ^j , 

(19.42a) 



r k+l =Tr k + c, 

(19.42b) 


where we have used T = — (D + L)^ 1 U, and c = (D + L)^ 1 b. The Jacobi and Gauss-Seidel update 
equations (19.36) and (19.42) arc on the same form, which implies that the same study of convergence 
will apply for both. We refer the interested reader to [Burden et al., 1997] for details. 

In general, the Gauss-Seidel method is considered superior to the Jacobi method; however, there are 
systems where one converges and the other does not and vice versa. From a computational viewpoint, the 
major difference between the Jacobi method and the Gauss-Seidel method is that in the Jacobi method all 
variables depend only on the solution from the previous iteration and can thus be updated in parallel. In 
contrast, the Gauss-Seidel method partly depends on both the previous and the current solution and is thus 
sequential in nature. Furthermore, the Gauss-Seidel method depends on the order in which the variables 
arc updated. This implies that if we compute the solution, r k , and then repeat the computation with a 
reordering, 7 r(-), of the variables to obtain the solution, n(r) k , then it is very likely that we find 

r k ^n{r) k . (19.43) 

Thus, reordering can affect the rate of convergence of the Gauss-Seidel method, which may be exploited 
to increase convergence rate. 

19.6 Successive Over Relaxation 

Suppose that r k is an approximation to the linear system Ar = b , then we may calculate the approxima¬ 
tion residual as 

p k = Ar k — b. (19.44) 

Let {p k } denote the sequence of residual vectors for the Gauss-Seidel method, corresponding to the 
sequence of solutions { r k }. The subscript might appeal - confusing; it refers to the currently known 
solution in the fe’th iteration just before updating the Fth variable. Inserting the Gauss-Seidel update rule 
(19.39) into (19.44) we find the m’th coordinate of p k+1 to be 

i— 1 n— 1 

Prrd — ^rri ^ ^ j ^ ^ j rn (19.45) 

j =0 j=i +1 

i— 1 n —1 

= b m ~ 22 a m ,jr k+1 - 22 a m,jr k (19.46) 

3=0 j=i 
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If we look at the f th coordinate, we have 


p!t l = b > X ' ; - X a ^ r f ~ ai ’ ir i 


ijT? + Pu +1 = b i - X - X 


i— 1 n— 1 

fc+1 \ ' „ „ fc „ „ fc 

3 

3=0 j=i +1 

i—1 7i—l 

fc+1 \ ' „ „ fc 

3 

3=0 j=i+ 1 


(19.47) 


a i)j r j 


Looking at the left-hand side and comparing with (19.39), we see that it is equal to a* j jr i fe+1 , so we have 

+ p. t - +1 = (19.48) 

Or if we rearrange terms we get 


h-\-\ 

r k+1 = T-/ 5 + (19.49) 

0*1,1 

Until now we only rewrite the update step in terms of the residual vector. We can gain more insight into 
the Gauss-Seidel method by repeating the above steps for the i + l’th variable update, which is 

i n— 1 


pi%\ = b > - X a M r /' +1 - X vj 


7=0 
i— 1 


7=*+i 

n—1 


6; X"' + r / ''' X 


. r fc _ 0 . fc+! 


7=o 


7=7+1 


. fc +1 


= 0 


fc+i 


The last step follows from (19.39). Thus, the Gauss-Seidel method is characterized by choosing r i 
in such a way that the r th component of 1 is zero. Reducing one coordinate of the residual vector 
to zero is not generally the most efficient way to reduce the overall size of the vector p / 1 . We need to 

small. Modifying (19.49) to 


choose r, : k+1 to make 


pX 1 


, fc+i 


. fc+i 


= rt+u- 


(19.50) 


where +■ > 0. For certain values of ui, the norm of the residual vector is reduced and this leads to faster 
convergence. The lo parameter is a relaxation of the Gauss-Seidel method, and the new iterative scheme 
is called successive over relaxation ( SOR ). When 0 < u < 1 the method is termed under-relaxation, 
and when 1 < lo the method is call over relaxation. Notice that when uj = 1 the method is simply the 
Gauss-Seidel method. 



“book” — 2005/9/30 — 15:44 — page 618 — #630 


618 


CHAPTER 19. SOLVING LINEAR SYSTEMS OF EQUATIONS 


algorithm S0R(A,x,b) 
r = initial guess 
for k=0,1,2,3,... do 
for i=0 to n-1 do 
delta = 0 
for j—0 to i-1 do 

delta += A(i,j)*r(j) 
next j 

for j=i+l to n-1 do 
delta += A(i,j)*r(j) 
next j 

delta = (b(i) - delta)/A(i,i) 
r(i) = r(i) + w(delta - r(i)) 
next i 

check convergence, continue if necessary 
next k 

end algorithm 


Figure 19.5: The SOR method. 


Insert (19.47) into (19.50) to reformulate the SOR method into a more implementation friendly equa¬ 
tion. 


r , k+1 = v, k +u 


bi - eU o«*7 +1 - Epii ««>•/ - . 


a,- 


= (1 - u) r k + — ( b t - y a^r 


i— 1 n— 1 

fc+t _ n r k 

3' j Z_^ u hJ'j I ! 

3=0 j=i +1 


(19.51) 

(19.52) 


whose pseudocode is shown in Figure 19.5. To determine the matrix form of the SOR method we rewrite 
(19.52) as 


2—1 


n— 1 


ai,iT k+1 + to ^2 ai ,i r J ' = (! - u) di^rf + uA :-u ^ 

3=0 j=i +1 

from which we see that 


{D + c oL) r fc+1 = ((1 - lo) D - uU) r k + tub 


or 


(19.53) 

(19.54) 

(19.55) 


r k+1 = (D + luL)- 1 {(1 - lu) D - loU) r k + lu (D + ujL)' 1 b 

If we let T = (D + cuL) -1 ((1 — t o) D — ujU), and c = lu (D + ivL)~ 1 b, then we see that the SOR 
method can be written on the form 

r k+1 =Tr k + c. 


(19.56) 
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Jacobi Method r k+1 = D 1 [b — {L + U) r 

Gauss-Seidel r k+1 = D 1 (6 — Lr fc+1 — Ur fc ) 

Successive over relaxation r k+1 = r k + oj {D 1 (b — Lr k+l — Ur k ) — r k ) 

Table 19.1: Matrix forms of the three iterative Matrix solvers for the system (L + D + U)r = b. 


In short, the SOR method has the same form as the Jacobi and the Gauss-Seidel methods. 

We end this section by listing the matrix forms of the three iterative matrix solver methods in Ta¬ 
ble 19.1. 


19.7 Stopping Criteria for Iterative Solvers 

A major point of any practical implementation of any iterative solver is to determine when to stop the iter¬ 
ations. A combination of stopping criteria is often used, such as a maximum on the number of iterations, 
and convergence tests. 

It is often a good idea to specify a maximum limit on the number of iterations that one is willing to 
use in an iterative method. There are mainly two good reasons for this. In a time-critical computation one 
wants to control how much time is spent on a computation, a maximum limit can help enforce this. In 
case the problem is unsolvable (i.e., diverges), an iterative method would loop forever, and the loop must 
therefore be stopped by a maximum iteration limit. 

This is typically done by some form of stopping test based on the convergence of the method. A 
convergence test is a test for how close the vector r r+1 is to a fixed point (see (19.33)). 

Definition 19.1 (Convergence) 

An infinite sequence {r fc } fc=0 = {r 0 ^ 1 ^ 2 ,...} of vectors in M n is said to converge to r * w.r.t. the 
vector norm ||-|| if, given any e > 0, there exist an integer N(e) such that 


r 


k 


— r 


<e for all k>N{e). 


(19.57) 


As is seen from Definition 19.1, the concept of a vector norm is vital to defining convergence. In theory, 
any vector norm can be used, but we will only consider the 1 2 and the norms, see Definition 18.11 and 
Definition 18.12. Often the convergence test will consist of several stopping criteria used in combination. 

An absolute estimate for the error can be taken by measuring the difference between the solutions of 
two succeeding iterations, r k+1 and r k . The simplest stopping criteria is to allow for a user-specified 
threshold, e, and to consider the sequence to have converged, when 


fc+i 



< £. 


(19.58) 


By comparison with Definition 19.1 it is evident that the test will succeed in case the iterative method 
converges. Nevertheless, if convergence is very slow, then this method will fail. Even worse, there exists 
divergent sequences where r k+l — r k converges to zero, as the following example illustrates: 
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Example 19.1 

Let the sequence {x fe }, —1 of real numbers be defined by 


x = 


k 

Y.7. 


(19.59) 


r =1 


Now the limit of the difference is 

lim (x k — x k 
k—> oo V 



= 0 


However, 


lim x k = lim 

k —>oo k —»oo 


f k > 

E; 

vr=l / 


= OO 


(19.60) 


(19.61) 


meaning that the sequence is divergent. 

As an alternative to the absolute error measure, a measure of relative error is sometimes applied. 


r fc+l _ r k 
N'T fc + l II 


< £. 


(19.62) 


Observe that the /:: + l’th solution must be nonzero in order for the relative test to work. Often the 
norm is used, since it is cheap to compute and is much more robust toward numerical deficiencies, such 
as overflow and underflow, in contrast to the 1 2 norm. 

Using Landau symbols, see (20.11), we may define the rate of convergence as when 


x k = a + 0(p k ) 


(19.63) 


then we say that x k converges to a with rate of convergence 0(3 k ). In many cases, the sequence {/3 k } 
has the form 


f5 k = 


1 

kp 


(19.64) 


for some number p > 0. We consider the largest value of p so that x k = a + ()(1 /k p ) to describe the rate 
at which x k converges to a. 

We have given this formal presentation of rate of convergence because iterative methods might reorder 
the variables such that variables with the slowest rate of convergence arc updated before variables with 
faster rate of convergence. In practice, the relative convergence test is not used. Instead, the magnitudes 
of absolute error of the individual variables arc used as an ad hoc measure of rate of convergence, that is, 
if 

(19.65) 


\ r fc+1 — r k \ <r It* k+1 — r k \ 
I' i ' i I ^ I' j j I 


The i’th variable is considered to converge faster than the j’th variable. This ad hoc way of measuring rate 
of convergence allows a simple approach in practice, where variables arc reordered in decreasing order 
w.r.t. the absolute error. 
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19.8 The Gradient Descent Method 


The gradient descent method [Press et al., 1999a, Golub et al., 1996, Shewchuk, 1994] is a nonstationary 
iterative process, meaning that its parameters, T and c, change with time, hence, our model problem has 
the following form: 

r k+1 = c k + T k r k . (19.66) 

The gradient descent method has its root in minimization. Assuming that we wish to minimize the 
quadratic equation, 

E(r) =-r T Ar — r T b + c, (19.67) 

then a common approach is to follow the gradient of E, 

VE(r) = -Ar + - A 1 r — b 
= i.(A + A T )r-b. 

in the negative direction. That is, using a forward Euler scheme we write 

r k+1 = r k -aVE{r k ), (19.69) 


(19.68a) 

(19.68b) 


where a some positive constant. This is known as the gradient descent method, and with sufficiently small 
o’s will minimize (19.67). When the gradient is zero, then the Euler scheme will be at a stationary point, 
and (19.67) will be at an extremal point. It is readily seen that ^ (A + A 1 ) r = b is a stationary point, 
and in case A is symmetric then (19.68) will reduce to 

VE(r) = Ar — b, (19.70) 


which will be stationary, when Ar = b. Hence, for positive-definite symmetric matrices, minimizing 
(19.67) is equivalent to solving (19.2). Geometrically positive-definite matrices have a structure similar - to 
x 2 + y 2 , and the gradient field is thus radiating from the global minimum as illustrated in Figure 19.6(a) 
and 19.6(d). In Figure 19.6 is the graph and gradient field plotted for 


f(r) = r 1 Ar, 


using the following three matrices, 

Positive-definite A 
Negative-definite A 
Indefinite A 


1 0 
0 lj ’ 

-1 0 ' 

0 -lj ’ 

1 0 ' 

0 -1 ' 


(19.71) 


(19.72a) 

(19.72b) 

(19.72c) 
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(d) (e) (f) 

Figure 19.6: Visualizing positive-definite, negative-definite, and indefinite matrices and their correspond¬ 
ing gradient fields. 


It is visually clear that for the negative-definite matrix then the gradient descent algorithm will converge 
toward a point at infinity. Of course, the maximum could be found by simply modifying the algorithm 
to perform a gradient ascent instead. Worse arc the indefinite matrices, which should not be expected to 
converge to the saddle point. 

In the remainder of this chapter we will assume that A is symmetric and positive-definite. Rather 
than choosing small values for a, we could instead search for the value of a minimizing (19.67), which is 
equivalent to searching for the minimum along the line starting at r k and with direction VE(r k ). This 
is known as a line search. To discuss the line searching procedure, we’ll first introduce the notion of error 
and residual. Assuming that the optimum is located at r *, and that the current position is r k , then the 
error in position is given as 


(19.73) 
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while the residual or error in b is 


p k = b — Ar k . 

(19.74) 

Using (19.73) and (19.70) we find that 


p k =-Ae k 

(19.75a) 

= —VE(r k .) 

(19.75b) 

In the line search algorithm, we search for the value of a in (19.69) that minimizes 
write the quadratic energy parameterized by a, differentiate and find the zero point 

(19.67); hence, we 

0 = |% w (a)) 
da 

(19.76a) 

= VE{r k+l (a)) T ^-r k+1 (a ) 
da 

(19.76b) 

= VE{r k+1 (a)) T VE(r k ). 

(19.76c) 

Thus, we should choose a such that the new search direction is orthogonal to the previous. Using (19.75), 

(19.74), and (19.69) we may determine a as follows: 


0 = VE(r k+1 (a)) T VE(r k ) 

(19.77a) 

= p k+lT VE{r k ) 

(19.77b) 

= (b- Ar k+1 Y VE(r k ) 

(19.77c) 

= (b-A (r k - aVE(r fc )) ) T VE{r k ) 

(19.77d) 

= (V + aAVE(r k )Y VE(r k ) 

(19.77e) 

= (~\7E{r k ) + aAX7E(r k )y \7E(r k ), 

(19.77f) 

which implies that 


VE{r k ) T VE(r k ) 

a = - - 

VE(r k ) T A T VE{r k ) 

(19.78a) 

(p k ) T Vp k 

(p k ) T A T p k ) ’ 

(19.78b) 


and where the transpose of A can be ignored since it is a symmetric matrix. The complete algorithm is 
shown in pseudocode in Figure 19.10. 
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Algorithm GradientDescent(A,b,r) 
do 

residual = b-A*r 
alpha = (r'*r)/(r'*A*r) 
r = r+alpha*residual 
until residual sufficiently small 
end algorithm 


Figure 19.7: Pseudocode for the gradient descent algorithm on (19.67) for a symmetric, positive-definite 
matrix A. 



Figure 19.8: Isophotes of an elongated energy function. At each step, the search direction is parallel to 
the gradient of the energy function. 


19.9 The Conjugate Gradient Method 

Gradient descent often is the method of choice where more complicated methods must give up. Neverthe¬ 
less, it is rather slow, since it will tend to take repeated steps in similar directions. This is especially the 
case when the energy function is deep and elongated, as shown in Figure 19.8. It would seem better to 
avoid searching in directions previously investigated, and hence make a sequence of orthogonal directions, 
{d °, d 1 ,..., d N }, such that 

r k+1 = r k + a k d k . (19.79) 

This is the conjugate gradient descent method. The consequence would be that once a direction had been 
investigated, then it would not be used again. Then the stationary point would be found in N iterations, 
where N is the dimensionality of r. As a consequence, the error at k + 1 must be perpendicular to the 
direction at k, and thus 

(d k ) T e k+l = 0, (19.80) 

as illustrated in Figure 19.9. Thus, inserting (19.73) and (19.79) we find that 
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Figure 19.9: Optimal distance along d k is such that a k d k is perpendicular to e fc+1 . 


(, d k ) T e k+1 = ( d k ) T (r k+1 -r*j 

(19.81a) 

= ( d k ) T [r k + a k d k — r*^j 

(19.81b) 

= ( d k ) T ( e k + a k d k ) , 

(19.81c) 

and conclude that 

ak _-{d k ) T e k 
( d k ) T d k 

(19.82) 

Unfortunately, this is not possible without the knowledge of e k , and if we knew e k . then we would also 
know the solution, r *. 

Fortunately, performing the line search method along d k in order to find the optimal a k for a fixed k 
is solvable. Hence, we start by taking the derivative of (19.67) parameterized by a k using (19.79): 

0 = ^-E(r k+ \a k )) 
oa k 

(19.83a) 

= VE(r k+ \a k )) T ^-r k+1 (a k ) 
oa k 

(19.83b) 

= VE(r k+1 (a k )) T d k . 

(19.83c) 

Using (19.75) and setting the gradient equal to zero we find 


0 = VE(r k+l {a k )) T d k 

(19.84a) 

= ( Ae k+1 Y d k 

(19.84b) 

= ( d k ) T Ae k+l . 

(19.84c) 


This is essentially the same equation as (19.80) except it is now with the metric A instead of the usual 
standard Euclidean metric. This is a generalization of orthogonality, and when two vectors fulfill a T Ab = 
0, then they arc said to be conjugate directions or A-orthogonal. 
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Continuing from (19.84), we rewrite the equation for conjugate directions by inserting (19.73) and 
(19.79), and we find that 


(d k ) T Ae k+1 = (d k ) T A{r k+l - r *) 

= (d k ) T A(r k + a k d k -r*) 

= (d k ) T A{e k + a k d k ). 

Finally, setting the above equal to zero, isolating a k , and using (19.75) gives 

— (d k ) T Ae k 
ak ~ (d k ) T Ad k 

~{d k ) T P k 

(d k ) T Ad k ' 


(19.85a) 

(19.85b) 

(19.85c) 


(19.86a) 

(19.86b) 


Notice that if we choose d k = p k . then the method simplifies to gradient descent (19.78). 

To find N conjugate directions, d k , we will use the conjugate Gram-Schmidtprocess : assume that we 
have a set of linear independent vectors {d 0 . d 1 ,..., d v ~ 1 }, for example, the coordinate axes, then we 
will iteratively produce vectors d k that arc A-orthogonal to the previous {d °, d 1 ,..., d fc_1 } vectors as 
follows. Start by setting, 

d° = 8°, (19.87) 

and then for i = 1..N — 1 iteratively remove dependence on 8 k from the previous selected directions d 1 
by 

k -1 

d k = S k -Y^Pkid 1 . (19.88) 

i =0 

The constants (3^ arc calculated using A-orthogonality, since for any j < k we enforce 

{d k ) T Ad j = 0, (19.89) 


and when expanding with (19.88) it is found that 

0 = {d k ) T Ad j 

/ k -1 

= U fc -£>id 

V i =0 


‘) T a* 


= ( S k ) T Ad j 

= ( 8 k ) T Ad j 


k -1 


'£,Pki{d i ) T Ad j 

i =0 

d kl (d') r Ad-'. 


Hence, isolating gives. 


(8 k ) T Adi 
~ (di) T Adi' 


(19.90a) 

(19.90b) 

(19.90c) 

(19.90d) 

(19.91) 
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Using the coordinate axes as the initial set of linear independent vectors {b 0 , b 1 ..... b v ~ 1 } is not 
optimal. Algorithmically, it’s much faster to use the residual at iteration k as b k . To see this, let’s first 
examine the error. The set {d k \ gives an orthogonal basis that leads from the initial point r 0 to the 
extremum point r *; hence, we may write the initial error e 0 as a linear combination of {d k }, i.e. 


JV-l 


e° = r°-r* = ^ 7 i d i 

(19.92) 

i= 0 


Using A-orthogonality, we may determine the k’th constant 7 fc as 


N- 1 


(d k ) T Ae 0 = ^2'y i (d k ) T Ad i 

(19.93a) 

i= 0 


= 7 fc (d fc ) T Ad fc 

(19.93b) 

(d k ) T Ae 0 
lk ~ {d k ) T Ad k 

(19.93c) 

Furthermore, using (19.79) we can write 


fc-i 


r k = r 0 + ^2 OLjd \ 

(19.94) 

3= 0 


and subtracting r * from both sides we get 


fc-i 


r k _ r * = r 0 — r * + ^2 a.jd, 

(19.95a) 

3=0 




fc-1 


•o> 

e 

w 

+ 

o 

VD 

II 

vu 

(19.95b) 

3=0 


This allows us to isolate e 0 and insert it into (19.93) to get. 


(dYA(e*-ZU a i di ) 

lk ~ {d k ) T Ad k 

(19.96a) 

VD ^ 

II 

(19.96b) 

(d k ) T p k 
(d k ) T Ad k 

(19.96c) 

= -a k . 

(19.96d) 
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The consequence is that for every iteration, we remove a term in the error; that is, inserting (19.92) into 
(19.95b) and using (19.96) we get 


N-l 

k -1 


Ey‘ d ’ 

+J 2 a 3 d> 

(19.97a) 

i=0 

3=0 


N-l 

k -1 


EVi* 


(19.97b) 

i=0 

N-l 

3=0 


E i‘ d ‘ 

i=k 


(19.97c) 


For the final iteration, the method converges, since we have k = N — 1 for which case the error is zero. 

It is smart to use the residuals first because the residual is orthogonal to the previous search directions. 
Using (19.83) and (19.75) we see that the residual is orthogonal to the previous search direction. 


0 = -VE{r k {a k )) T d = (p k ) T d k ~ l . (19.98) 

Furthermore, the residual is orthogonal to all the previous search directions, that is, 

{p k ) T d j = Ofor j < k. (19.99) 

To prove this, consider the sequential buildup of the error using (19.75), (19.79), (19.73), and (19.86), 

(p k ) T d j = -{e k ) T A T d j (19.100a) 

fc-i 

= -(e° - ^a i d i ) T ^ T d j (19.100b) 

i= 0 

k—1 

= -(e°) T A T d j + ^a i (d i ) T A T d j (19.100c) 

i= 0 

= — (e °) T A 1 d^ + oP {d ■ J ) T A 1 d J (19.100d) 

= — (e °) T A T d J — (d J ) T p J (19.100e) 

= -( e°) T A T d j + ( d j ) T Ae j (19.1000 

3 - 1 

= -(e°) T A T d J ' + (d j ) T A(e°-^2a i d i ) (19.100g) 

i= 0 

= -(e °) T A T d j + (d j ) T Ae 0 (19.100h) 

= 0 (19. lOOi) 
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The final step to reach the algorithm is to consider a simplification for calculating f5y.j in (19.91). Since 


p k+1 = —Ae k+1 

(19.101a) 

= -A 

(r fc+1 - r 

(19.101b) 

= -A 

(r k + a k d k -r*^j 

(19.101c) 

= -A 

. (e k + a k d 

(19.lOld) 

= P k 

— a k Ad k . 

(19. lOle) 

Multiplying both sides with (p i) T with get the numerator from (19.91) 


(, P j f P k+1 = (p j 

) T p k - a k (p j ) T Ad k 

(19.102a) 




( p ifAd k = ±< 

a K 

(( pi) T p k -(pi) T p k+1 ). 

(19.102b) 

However, since all residuals are orthogonal we get 


(, p j ) T Ad k = < 

j = k , 

) = k + i, 

(19.103a) 


k 0 otherwise, 



and since /i/,. ? is only calculated for j < k we get 


ftkj 


( P j ) T P j 

ai- 1 (di- 1 ) T Adi - 1 


j = k + 1, 

otherwise, 


(19.104) 


Finally, the complete algorithm is shown in Figure 19.10. The conjugate gradient method converges 
in theory to the exact solution in N steps, where N is the dimensionality of r. However, round-off errors 
will cause the solution to deviate from A orthogonality, and hence the convergence will not be perfect 
after N steps. Furthermore, for large problems it may be desirable to terminate the conjugate gradient 
iteration before convergence to reduce the computation time for inexact problems. 


19.10 The Linear Complementarity Problem 

The Linear Complementarity Problem ( LCP ) is a well studied mathematical problem (see [Murty, 1988, 
Cottle et al., 1992]). The linear complementarity problem is used to solve for a discontinuous relation¬ 
ship between two variables such as contact conditions between rigid bodies (see Chapter 7). Linear - and 
quadratic programming is often mentioned in connection with the linear - complementarity problem. The 
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Algorithm ConjugateGradient(A,b,r) 
direction = b - A*r 
residual = direction 
for i = 1:length(b) 

alpha = (r'*r)/(direction'*A*direction) 

New_r = r + alpha*direction 

New_residual = residual - alpha*A*direction 

beta = (New_residual'*New_residual)/(residual'*residual) 

direction = New_residual + beta*direction 

r = New_r 

residual = New_residual 
end 

end algorithm 


Figure 19.10: Pseudocode for the conjugate gradient algorithm on (19.67) for a symmetric, positive- 
definite matrix A. 

main reason is that a linear programming ( LP ) problem can be rephrased as a linear complementarity 
problem, and a quadratic programming ( QP ) problem can be reformulated as a linear complementarity 
problem. The linear complementarity problem is a general problem, which unifies linear - and quadratic 
programs. Finally, linear - programming and quadratic programming are special cases of mathematical 
programming. 

Several methods exist for solving linear - complementarity problems, there are four main categories of 
methods: Newton methods, interior point methods, direct methods (also known as pivoting methods), and 
iterative methods. We refer the interested reader to [Murty, 1988, Cottle et ah, 1992] for details. 

Implementing a LCP solver can be cumbersome and tricky, fortunately there already exist some 
solvers. The following is a brief overview of a few existing solvers. For robust simulations PATH from 
CPNET by Michael Ferris et al. [Path, 2005] is highly reconmrendable. It is, as far - as we know, based on a 
Newton method. The Lemke algorithm (a pivoting algorithm) is often considered to be a good choice, due 
to its wide applicability and robustness. The source code accompanying the game physics book by Eberly 
[Eberly, 2003a] contains an implementation of Lemke’s algorithm. Open Dynamics Engine by Russel 
Smith [ODE, 2005] contains aDantzig Pivoting algorithm, an iterative Successive-Over-Relaxation (SOR) 
LCP Solver, and a Conjugate Gradient solver. Also OpenTissue [OpenTissue, 2005] contains implemen¬ 
tations of the Dantzig Pivoting algorithm and an iterative SOR/Gauss-Seidel LCP solvers. 

The current trend in the Game industry [NovodeX, 2005, ODE, 2005, IO Interactive A/S, 2005] is 
to use iterative methods for solving LCPs, since these provide a tradeoff between accuracy and speed. 
Furthermore, iterative methods are well-suited for sparse and extremely large problems. The trade-off 
property can be exploited in time-critical environments to obtain a solution as accurate as the time permits. 
Notice that unlike for instance the pivoting algorithms, the iterative methods provides an approximate 
solution within a given threshold. 

In the following we will make use of the inequality relation, >, for vectors. This is a shorthand 
to denote that the inequality holds for each pair of components, that is u = (u\, U 2 , ■ ■ ■, un) T > 
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(v\,V 2 , ■ ■ ■ ,vj\ j) t = v, if and only if u, > v t for 1 < % < N. Now, let us briefly restate the linear 
complementarity problem we want to solve. 

Definition 19.2 (The Linear Complementarity Problem) 

Given a symmetric matrix A £ M nxn , a vector b £ M n , a vector of lower limits, x\ 0 < 0, and upper limits 
x M > 0, where xi 0 , x^ £ W 1 . Find x £ R n and w £ R n such that 


w = Ax — b, (19.105a) 

xio<x< x hi , (19.105b) 

and for all i = 0,..., n — 1, one of the three conditions below holds 

Xi = X [ 0i , Wi > 0, (19.106a) 

Xi = x hii , Wi < 0, (19.106b) 

X[ 0i <Xi < x Mi , w i = (19.106c) 


Notice that as long as x, is within its lower and upper limits, w, is forced to zero. Only at the lower and 

upper limits is non-zero. Usually, the general LCP is formulated with x\ a = 0 and x\, t = oo, in which 

case the above definition reduces to the single set of conditions 

w = Ax — b > 0, (19.107a) 

x > 0, (19.107b) 

x T w = 0. (19.107c) 

In multibody dynamics problems the A-matrix is often symmetric, unless a linearized friction cone is 
used, in which case auxiliary variables arc used to couple the normal force to the tangential force at a 
contact point. The A-matrix is often positive semi-definite (PSD) or sometimes positive definite (PD). 
Even if it is PSD, tricks such as damping (adding positive values to the diagonal) can be applied to make 
it PD. To make a long story short, the A-matrix can be made numerically more pleasant, however, the 
solution will be damped and therefore yield weaker constraint forces. 

In this section we will limit ourselves to present iterative methods based on matrix splitting methods. 
These iterative methods arc called projection methods. The main idea is to combine a matrix splitting 
method (described in Section 19.5 and Section 19.6) with a projection/clamping operation. 

Splitting methods can be summarized as follows: we want to solve Ax = b, and to do so we introduce 
a splitting of A as A = M — N, hence 

(M — N) x = b, (19.108) 

which gives the iterative scheme, 

Mx k+1 = b + Nx k , (19.109) 

x k+1 = M~ l {b +Nx k ). (19.110) 


The Jacobi method uses M = D and N = — (L + U ), whereas Gauss-Seidel take M = (D + L) and 
N = —U. Here AT' 1 can be understood as a preconditioner to the iterative scheme in (19.110). 

The projection operation can be summarized as: 
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Definition 19.3 (Projection) 

Given a vector x E R n , a vector of lower limits, xj a < 0, and upper limits x /„• > 0, where x/ 0 , Xhi G R n , 
the projection operation on x is written ( x) + , and means that for each j = 0 to n — 1 

*+ = max (min (xj , x Mj ) , x hj ) . (19.111) 

As it can be seen from Definition 19.3 the projection operation works element-wise. If a coordinate 
exceeds a lower or upper limit then it is projected back onto the violated limit. We can now present the 
iterative LCP solver. 

Corollary 19.1 (The iterative LCP solver) 

Given a splitting method, an iterative solver to the LCP problem in Definition 19.2, is given by 


x 


k+l 


= (M~ L (b + Nx 


(19.112) 


According to [Murty, 1988] the only requirement is that A is symmetric. In which case it can be proven 
that if the sequence of solutions converges to a limit point, then that point will be a solution of the LCP 
(see Theorem 9.9 pp. 369 in [Murty, 1988]). Furthermore, it can be shown that if A is also PD then the 
sequence will converge to a solution of the LCP (see Corollary 9.3, pp. 372 in [Murty, 1988]). This is 
often enough for our purpose. We refer the interested reader to [Murty, 1988] for detailed proofs regarding 
the general iterative scheme. 

Convergence rate, p, can be defined as: 

11 e k 11 

oo iigfc—l ip = A ’ (19.113) 


where e k is the error in the k’th iteration and A is called the convergence constant. Linear Convergence 
means p = 1, so 


= A 


1 


(19.114) 


Whereas quadratic means P = 2, and 


= A 


„fc-i 


(19.115) 


The main problem with the iterative method we have presented is that it only has linear convergence. In 
practice this means that after a certain number of iterations, the improvement per iteration is decreasing. 
In fact one will always get closer to the solution, but never really hit it. This is unlike direct methods, 
which is capable of providing the exact solution to the problem. Besides, the speed of convergence of the 
iterative methods is dependent on the order of the variables. 

The theoretical convergence rate of Newton methods is quadratic, even though [Lacoursiere, 2003] 
only reported linear convergence for multibody dynamics. Interior point methods also have quadratic 
convergence. The amount of work per iteration in Newton methods is no worse than 0(n :i ). For direct 
methods the amount of work per iteration is no worse than 0(n 2 ). Lemke’s method needs n iterations to 
converge which yields a total complexity of 0(n 3 ). 
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In contrast, the amount of work needed in the iterative method per iteration is only 0(n), if an up¬ 
per bound on the number of iterations can be set. This effectively mean, that the solver is linear in 
time. Of course the solutions may be horrible, and one need other techniques to remedy these artifacts 
[Erleben, 2005]. 

In comparison, Newton methods, direct methods, and interior point methods arc more costly per itera¬ 
tion than the iterative methods. On the other hand they often have better convergence properties, implying 
they only need very few iterations to reach an acceptable solution. Nevertheless, iterative methods arc 
very simple to implement, and in combination with shock-propagation they can deliver acceptable solu¬ 
tions within 5-10 iterations [Erleben, 2005]. 
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20 

Taylor Expansion and Derivative Approxi¬ 
mations 

Many measurements and representations consist of a discrete set of values sampled on a manifold. For 
example, we may consider an image to be a discrete set of values sampled on a regular grid on a plane, and 
the surface of a shape may be represented as a discrete set of triangles. To perform differential geometry, 
we need to assume that the discrete set of values is sampled from an underlying analytical function. 

20.1 Taylor Series 

In this section we will briefly review Taylor series, since they yield great insight into the approximation of 
smooth functions. This chapter is inspired by the presentation in [Bronshtein et al., 1997]. 

Assume that we arc given a ID “well-behaved” function, / : K -> M, for which all derivatives exist 
everywhere. That is, the n’th order derivative f (n> is defined in all points. The value of the function / in 
two points x and x + h arc related through the Taylor series : 

f(x + h) = f[x) + hf'(x) + i h 2 f"(x ) + ^h 3 f'"(x) + ... 

°° M 

= Y j ^f^{x). ( 20 . 1 ) 

i= 0 J ' 

This is also known as the power series; the special case where x = 0 is also called the Maclaurin series. 

As an example, consider the exponential function f(x) = exp(x). It is arbitrarily differentiable as 
/= /, and exp(0) = 1. Hence, 

OO L.Z 

exp (h) = f(0 + h) = y^-r, (20.2) 

i=o l - 

and 

OO 

exp(l) = ^-. (20.3) 

*=o 

Before we give Taylor’s theorem, we will first examine the mean value theorem. 

Theorem 20.1 (Mean Value Theorem ) 

For a function f : M —> M, which is continuous in [a, b] and differentiable in (a, b), there exists an 
xo G (a, b), such that 

f(xo) = MzM. (20.4) 

b — a 
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We can rewrite (20.4) as 

f(P ) = /(a) + {b~a)f'{x 0 ), 

(20.5) 

and using h = b — a, and 9 = 

\ a ,we find 



/(a + h) = f (a) + hf\a + Oh). 

(20.6) 


This form of the equation has the spirit of the Taylor theorem. 

Theorem 20.2 (Taylor’s Theorem ) 

For a function f : M —> K, which is (n + 1) times differential on some open internal x G (a, b), then 


n ui 

f(x + h) = ^2 — f®(x) + Rn(x, h), (20.7) 

i =0 

where the remainder on Lagrange’s form is given by 

un +1 

R n (x, h ) = -——f( n + 1 \ x + 0h), (20.8) 

(n + 1 )! 

and where 9 £ (0,1) is an unknown constant given by Theorem 20.1. 

A function / is said to be an analytical function at the point x, if for all |/;| < r > 0 

oo h ^ 

f(x + h) = ^2 ~ / W W' (20.9) 

*=o *' 

The largest h, for which the sum converges, is called the radius of convergence. 

An often-used notation in Taylor expansion is the Landau symbol 
or “Big-O,” which disregards the exact form of the remainder by 

Rn(x,h) = 0(h n+1 ), (20.10) 

meaning that given a function g(x ) = 0(f(x)), then there exists a constant M £ R independent of x, 
such that 

\g(x)\<M\f(x)\ (20.11) 

for all x possibly limited to some finite interval. Note that the Landau expression g(x) = 0(f(x)) is 
a statement about a set of functions, and we can be tricked into erroneous conclusions. As an example, 
assume that g(x) = 0(f(x)). Since the Landau symbol only defines relations up to some unknown 
constant M, we can conclude that 3 g(x) = 0(f(x)). This is valid since the two uses of the Landau 
symbol imply two different constants; but beware it does not imply that g(x) = 3 g(x). Nevertheless, it is 
correct to infer that 


0(x m ) + 0(x n ) = 0(x n ) 


(20.12) 
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when n > m, and it may also be inferred that 

xO(x n ) = Q(x n+1 ). 


(20.13) 


Using Landau symbols, the Taylor series may be written as 

n W 

f{x + h) = Y J -/ (<) (*) + 0(h n+1 ). (20.14) 

i=o L ' 

The extension to higher dimensional functions is straightforward, since differentiation is separable. 
For example, assume we have a 2D function / : M 2 —> M, then the Taylor expansion is given as 


/(x + A. , + k) = ± fl- t „) + 

i =0 j =0 ' y 


Reordering terms we get 


f {x + h ,y + k) = J {x ,y ) + h a J^ +k d ’^ 


dx 


dy 


, h 2 d 2 f(x,y) ( k 2 d 2 f(x,y ) | U 1 d 2 f(x,y) 

' ^ r\ O I r\ O ’T" 


2 Ox 2 2 0y 2 


dxdy 


(20.15a) 

(20.15b) 


(20.16) 


20.2 Finite Differences by Forward, Backward, and Central Approxima¬ 
tions 

In the following, we will discuss approximations of derivatives of a function using its neighboring values. 
The theory to be presented in the following has been inspired by [Eberly, 2003a], Consider the Taylor 
expansion of a ID analytical function around x. 

f(x + h) = f(x) + hf'[x) + (D(h 2 ). 

Using this, we may estimate the derivative in x as 

rw = /( * +h >- /w + <m 

which is known as the forward difference approximation. Likewise, if we consider 

f(x - h) = f(x) - hf (x) + 0(h 2 ), (20.19) 


(20.17) 


(20.18) 


© 


© 


© 


© 
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then we get the backward difference approximation, 

= /(*) - /(x - h) + (20.20) 

h 

Finally, subtracting (20.19) from (20.17) and solving for f yields the central difference approximation 

= /(x + h) - ./(X - h) + 0(ft2) (20.21) 

2 h 

In contrast to the forward and backward difference approximations, the error term in the central difference 
approximation is one order higher, since the second-order terms in the Taylor expansions cancel. 

A finite difference is a linear combination of neighboring function values, hence we may view an 
approximation as a filter kernel. For example, the kernel for (20.21) would be ^[1, 0, —1]. This principle 
may be generalized to all linear combinations of neighboring function values. 

An approximation of any derivative of an analytical function by a linear combination of neighboring 
function values will now be derived. The problem is stated as follows: find the linear set of coefficients, 
C{, which approximates a d order derivative to p order accuracy as 


d\ 


f [d \x) = p <*/(* + ih ^> + 


( 20 . 22 ) 


^—^min 


using neighboring values of / with i max > i rnm . 

To begin, we rearrange terms for mathematical convenience 


h°_ 

~d\ 


f (d \x) + 0{h d+p ) = £ af(x + ih), 


(20.23) 


and insert the infinite Taylor series for /, i.e. f(x + ih) = o ^tt ~f^ n \ x ), on the right-hand side of 
(20.23). This gives 


id °° Atiun 

-fW( X )+o(h d+ n= £ Cij2 


i —2 m i n —0 


n\ 


(20.24) 


Truncating the infinite series to d + p order accuracy gives 

id (max •nun 

-fW( X )+0(h d+P )= £ Ci £ —fW( X )+0(h d +*), 


n =0 


(20.25) 


and with a rearrangement of the terms we find that 


d~\~P 1 / im ax 

p d \x)+om= y. E in ‘ 

n= 0 \*=*min 


d\h n 

n\h d 


f^ n \x) + 0(h p ). 


(20.26) 


© 


© 


© 
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The solution to this system is to choose c = {c,}, such that 


*max 



*=^min 



if n = d, 
otherwise, 


or expressed as a matrix equation. 


Ic = 


1 

1 m i n 

4 2 . 

min 


i 

^min+l 

? 2 

'min+1 


■d+p-1 -d+p-1 

L'min 'min+1 





'O' 

1 


Qmin 



^max 

.o 


Qmin+1 


0 

7 ^ 

‘'max 


Qmin+2 

= 

i 





0 

■d+p-1 

‘'max 


Qmax 


.0. 


(20.27) 


(20.28) 


where I = {i n }. If the extend of the linear approximation is chosen such that i max — i m j n + 1 = d + p, 
then I in (20.27) is a quadratic (d+p) x (d+p) Vandermonde matrix , and the p + d unknown coefficients, 
c+ may be shown to have a unique solution. In that case, choosing /' mm = 0 yields forward difference 
approximations, i max = 0 yields backward difference approximations, and choosing i m i n = — i max results 
in the central difference approximations. 

To conclude, the cPth order derivative of an analytical function may be approximated to p’th order 
accuracy as a linear function of the neighboring values using 


ti ‘max 

f {d) (x ) = — d Y, c if( x + ih) + (20.29) 

i —^min 

where the coefficients <+ are solved according to (20.27). 


© 


© 


© 


© 
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Calculus of Variation 


For a one-dimensional function, f(x), we may seek the extremal points, i.e., the points, x, where the 
function is either maximal or minimal. If the function is continuous and has a well-defined first-order 

df(x) 

derivative, the extremal points may be found as dj , = 0. Similarly, calculus of variation is concerned 
with finding extremal functions of functionals, i.e., problems on the form 

L(f)= f C dx, (21.1a) 

J a 

/ = argmaxL(/), (21.1b) 

where C is integrable over the domain x £ [a, b] and is a function of x, /, and derivatives of / w.r.t. x. 
Minimizing problems are also covered by the above formulation since max j f —C dx is the minimum 
of (21.1). In calculus of variation, the extremal points arc also called stationary points. On a historic 
note, a fundamental result of calculus of variation arc the Euler differential equations first described by 
Euler (Leonhard Euler 1707-1783) in 1744 [Euler, 1744] and later improved by Lagrange (Joseph-Louis 
Lagrange 1736-1813) in 1755-1762 [Lagrange, 1762], The Euler-Lagrange equations for various func¬ 
tionals will be derived in this chapter. Below we give a short review of [Hansen, 1988]. 

21.1 Deriving the Euler-Lagrange Equation 

In the following we will derive the simplest of the Euler-Lagrange equations for integrals on the form 
(21.1), where 

C = c(x, f(x), = £ (x, f(x), f x (x )), (21.2) 

and where f(x) and f x (x) = arc functions depending on the independent variable x. The function 

/ is often referred to as the dependent variable. 

To find extremal values of (21.1), we will introduce a Taylor series of functionals as 

L(f + A/) = L(f) + AfSL(f) + ..., (21.3) 

where 5L(f) is the first variation of the functional. Unfortunately, in contrast to the Taylor series devel¬ 
oped in Chapter 20, the series in (21.3) depends on / and A/, which arc functions rather than variables. 
An analytical function may be represented as a vector of samples [/i, /h, • • •, f?i] T , which in the limit of 
infinitely densely sampling is equivalent to the continuous function /. Each sample would in such a case, 
be a variable, which would imply that / had infinitely many variables. Luckily, the problem can be solved 
essentially as a one-dimensional problem. 
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Assume that we arc given a function f(x'), which makes the integral extremal; then the first variation 
is zero, that is, 

5L(J(x)) = 0. (21.4) 

Introducing a small arbitrary constant, e, and an arbitrary function, ?/(x), for which 


V(a) = v(b) = 0 , 

allows us to write the variation of / in the // direction as 


and as a consequence 


f(x) = f(x)+£T](x), 


fx(x) = f x (x)+£r} x (x). 


(21.5) 


( 21 . 6 ) 


(21.7) 


For an arbitrary but fixed r/, we have converted the infinite-dimensional problem into a one-dimensional 
problem, i.e., we can write (21.4) as [Hansen, 1988, Equation 1.3 in Section 1.7], 


SL(f(x))= -^L(f(x)+£r](x)) =0. 

l d£ J £=0 


( 21 . 8 ) 


Assuming that a and b are independent of e, and using Leibnitz’s rule for differentiation of integrals 
[Spiegel, 1968, p. 95 Equation 15.14], the differentiation may be moved inside the integral as 


SL (f( x )) = -jz J C ( x > /(- T )> fx{ x )) dx 

= J (x , f(x), f x (x)) dx 

r b dCdx d£df_ d£ df x 

J a dx ds df de dfx de x 

f b dc df dc dfx 

J a df de df x de X ’ 


(21.9a) 


(21.9b) 


(21.9c) 


(21.9d) 


where the last simplification uses = 0. Recalling the rule for integration by parts 


L ^ dx =|fei ‘ - L t xdx - 


we can rewrite the last terms in (2E9d) as 

f b dC df x 

J a dfx de X 


f b dc ( d df\ , 

a dfx \dx de) X 
Q£Qfd> _ rb /d_dC \ df_ 

df x de _ a J a y dx dfx ) de 


( 21 . 10 ) 


(2E1 la) 


(21.11b) 
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Inserting first (21.6) and then (21.5) into (21.1 lb) we find that 


rb dc df x , r dc , a h r b ( d dc 


df x de dx [df a 


"Hr/ HU) ” {x)dx 


b ( d dc 

\ dx df x 


rj(x)dx. 


(21.12a) 


(21.12b) 


Inserting (21.12b) and (21.6) in (21.9d) and collecting terms for rj(x) = H/H- we find that 


rb \dC d f dC\ 1 . . , 

, m --e [m )\ ,ix)ix =°- 


(21.13) 


Since y{x) was chosen as an arbitrary function for which //(a) = rj(b) = 0, we must conclude that this is 
only possible when 

dC d f 8C\ 

~df~dx \df^J m ' } 

To conclude, the function / is a solution to (21.4), if it is a solution to (21.14). 

The variational derivative [Goldstein, 1980, Equation 13.63] is the direction in the function space for 
where L in (21.1) increases the most, and it is found as the difference between the right-hand and left-hand 
side of (21.14), 


5_ _dC_d_ f d£ 

5f Of dx \df x 


(21.15) 


21.2 Equation for Many Independent and High-Order Derivatives of 1 De¬ 
pendent Variable 


For more independent variables such as x, y, z, and t, where 

r r ( , ,/ ,x df{x,y,z,t ) df{x,y,z,t) df(x,y,z,t ) df(x,y,z,t) 

C= C I x,y,z,t,f(x,y,z,t), - — -,- — -,- — -,- — - 


C = C \ x,y,z,t,f(x,y,z,t), -—-,- — -,- — -,- — - I (21.16a) 

= £ (x, y, z, t, f(x, y, z, t ), f x (x, y, z, t), f y (x, y, z, t ), f z (x, y, z, t ), f t (x, y, z, t)), (21.16b) 

the derivation of (21.14) yields the following simple extension [Hansen, 1988, Equation 5 in Section 2.5], 


dC d ( dC\ d ( dC\ d ( dC\ d (dC 


df dx \df x J dy \df y J dz \df z J dt \df t 

In this case, the variational derivative is given as 

A £ = (JL _ y A £ 

5f [df V dxidf Xi ’ 


(21.17) 


(21.18) 
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where x, arc the independent variables x, y. z..... 

We will now derive the Euler-Lagrange equations for the case where C in (21.4) depends on higher 
order derivatives, for example, 


C = £(x,y,f(x,y),f x (x,y)J y (x,y) 

J fxx (x,y) ’ fxy( X ! y)i fyy (x,y)), (21.19) 

where f xy = etc. 

To simplify, we introduce the following shorthand notation: assume that there arc n independent 
variables, then let f Xi = denote all possible first-order derivatives, and let f XiXj = denote all 

possible different second-order derivatives. We may now write 

C C(x i .f.f,,.f XiX .). (21.20) 


Again we assume that / is a solution and pick an arbitrary constant £ and some unknown function ?/, 
which is zero at the border of our domain, and examine (21.8). By straightforward differentiation we get 


! l(/w) = 


rbi 

rb„ 

/ a\ 

J an 


dC, df d£ df Xt v'' dT dfxn 


—iL i \" 

F! t Ar + Z_^ 


+ EE 


d f de jr{df Xi de ^ f-r »f x . x; de 


dx\---dx n . (21.21) 


The first-order terms may be independently integrated by parts and the fc’th term is rewritten as 


fbl 


dc df Xk 
df Xk de 


dx i • • • dx n = — 


r*6i 


d dC\df 

dx\ • • • dx n . 


dx k df x ,J de 


( 21 . 22 ) 


since r)(x i,--- ,a k ,--- ,x n ) = rj(x lr ■■ ,b k ,--- ,x n )) = 0 Vaq,--- , x k+u ■ ■ ■ ,x n . Likewise, 

the integration by parts for the h < fc’th second derivative term is rewritten as 


rbi 


/ a\ 


df: 


a n ^Jx h x k 


9C df XhXk 
de 


dx i • • • dx n = — 


rbi 


' a\ 


d dc \ df Xk 
dx h df XhXk J de 


dx i • • • dx n (21.23a) 


'ai 


b1 f bn ( d 2 dC \df , 

/ 3—3—777- t— dxi---dx n , ( 21 . 23 b) 

Jan \dXfrdXk OJx h x k J OS 


where we repeatedly have used the zero border conditions of //. Gathering all terms yields the Euler- 
Lagrange equation: 

dc ^ d dC A A d 2 dc 


df 


E ll ujl, \ a \ a 

f f) f / 


dxi df Xi ^ ^ dxjdxi df XiXj 


= 0 


(21.24) 


and the variational derivative becomes 


Sf 


| d d d -y -y d? d 

df 2 ^ dxi df Xi + 2 ^ “ dxjdxi df XiXj 


C. 


(21.25) 
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21.3 Equation for Many Dependent Variables 

In the situation where there arc several dependent variables, such as f. g ,.... the integration is over 
functions on the following form. 

C = C (x,f(x),f x (x),f xx (x),...,g(x),g x (x),g xx (x %....). (21.26) 

To evaluate the variation we introduce as many extra variables as dependent variables 

f{x) = f(x) + er](x), (21.27a) 

g(x) = g(x) + 7 £(x), (21.27b) 


for arbitrary functions //, £, ... being zero on the boundaries. The integral is extremal, when the variation 
w.r.t. /. g.... is zero. That is, we seek to solve the following system of equations: 


$f L (f(x),g(x),---) = 

5 g L(f(x),g(x),...) = 


-^L(f(x)+eg(x),g{x),...) 
J^ L (f(x), g{x) + 7 £(x), ...) 


£—0 


= 0, 

(21.28a) 

= 0, 

(21.28b) 


J 7=0 


where df is the variation with respect to /, etc. 

Each equation leads to independent differential equations as derived above, that is, the general solution 
to the variational problem up to second order may be reformulated as solving the following system of 
differential equations: 


dC 

df M 


E 


d dC 


dxi of( m ) dx.jdxi or( m ) 

1 OJ Xi j =1 j=i J 1 oj XiX ■ 


d 2 dC 


= 0, 


where /' m / is the dependent variable and x/,. is the independent variable. 
The variational derivative is naturally given as 


8 

8f 


C = 


r <5 -i 


Ijm 


5/(2) 

c = 

<5 


L( 5 /( m ) J 



dC 

d dC 

OfW 

^ dxi df © 

dC 

d dC 

djm 

dXi ffji 21 

dC 

d dc 

df(M) 

dxi Qf( M ^ 


I spn V n d 2 dC 

' Z^i=r 2—/j=i dxjdxi r) 

I o! 2 dC 

' l^i=l 2~/j=i dxjdxi a j -( 2 ) 


i sr^n v ' n 
' 1 2-^j=i 


df% Xj 


a9 dC 


jj=% dxjdxi gf( M \ 

where / = [/ (1) , / (2) , • ■ •, / (M) ] T is a vector of M dependent variables. 


(21.29) 


(21.30) 
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Basic Classical Mechanics 


An understanding of basic physics is required to fully understand the physically based modeling that 
is applied in physics-based animation. In this chapter we briefly present the basic physics required for 
reading this book. 

The first four sections of the chapter arc mostly concerned with the basic theory of classical rigid body 
mechanics. In Section 22.1 we present the equations of motion of rigid bodies. Thereafter, in Section 22.2 
we discuss friction, and in Section 22.3 we consider work and energy. Having presented the reader with the 
theory of rigid bodies in Section 22.4, we will present the basic theory of the harmonic oscillator, which is 
important for understanding mass-spring systems. Following this, we will introduce the Lagrange formu¬ 
lation in Section 22.5, which may be used when modeling deformable object in physics-based animation. 
In Section 22.6 we treat the principle of virtual work, which is often used when considering constraint 
forces. Finally, we close the chapter with an introduction to stress and strain in Section 22.7, which arc 
used to describe the physical properties of elastic objects. 


22.1 The Newton-Euler Equations of Motion 

In the following, we will introduce the reader to the equations of motion, also known as the Newton-Euler 
equations. The theory found here is based on [Kleppner et ah, 1978, Goldstein et al., 2002]. 

22.1.1 The Trajectory of a Particle 

A particle is defined as a point with mass, but no volume. Hence, a particle is described by its mass, m, 
and its trajectory, r(t), giving the particle position as a function of time, as illustrated in Figure 22.1. The 
velocity of the particle is defined as the limit of the difference between two succeeding positions as the 
time-step goes to zero. 


. . r(t + At) — r(t) .. A r(t) dr(t ) ... 

v(t) = lim -A- -L -— = i im —AA = —LT = r f t ) 

Ai^O At At^O At dt 


( 22 . 1 ) 


which is depicted in Figure 22.2. That is, the velocity is the time-derivative of the position. Similarly, the 
acceleration is defined as the limit of the change in velocity, 

. . .. v(t + At) — v(t) dv(t ) ....... .. . 

aW = iS, At = ~dT = <22 ' 2) 


as illustrated in Figure 22.3. That is, the acceleration is the double time-derivative of the position. 
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22.1.2 Newton’s Laws of Motion 

For pedagogical reasons, we will begin by formulating Newton’s law of motion for particle motion. New¬ 
ton ’s first law of motion states in words that 

• Inertial systems exist and an inertial system moves uniformly. 

• An isolated particle moves uniformly in an inertial system. 

In computer animation, Newton’s first law of motion is used to define a global working frame called the 
world coordinate system (WCS). The world coordinate system should be thought of as an ideal inertial 
system, implying that it moves with zero velocity. Of course, in the real world such a coordinate system 
cannot be uniquely determined, since only relative motion can be measured. Nevertheless, this ideal world 
coordinate system is useful in computer animation because it provides a global frame of reference in which 
the motion of all objects in the world may be described uniquely. 

Newton’s second law of motion defines the linear momentum, p, of a particle as 


p = mv, 


(22.3) 


where m is the mass of the particle and v the velocity. Furthermore, Newton’s second law states that a 
force, F, has both magnitude and direction. The effect of the force is a change in linear momentum. 


F(t) = = FNd = JNfi = mo(t) , 


dt 


dt 


dt 


(22.4) 


from which we have the relationship force equals mass times acceleration. Note that this is not a definition 
of force, but rather it describes the effect of a force as an acceleration. 

The point of action denotes the actual physical point where a force is applied. For a particle, the point 
of action is simply the position of the particle. The point of action becomes very important when we 
consider rotational motion later on. 

Newton’s second law of motion also states that forces obey the superposition principle. That is, if all 
forces, Fj, acting on a particle are known, then the total effect they have on the particle is simply their 
sum, 

Ftot© = ^2 Fi ^ = m ^2 a ©) = ma tot(/)• (22.5) 

i i 

Newton’s third law states that reaction equals action, and that all forces are due to real-world interac¬ 
tions. The first paid of this law comes in two flavors, a weak form and a strong form. The weak law states 
that 

Fab = —Fba, ( 22 . 6 ) 

where a particle A is affected by the force Fab coming from another particle If and the force Fba is the 
force that particle A affects particle B with. The strong law requires that Fab and Fba must be central 
forces, meaning that they must lie on the same line of action, that is, on the line between the two points of 
action. The two flavors are illustrated in Figure 22.4. 
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The Weak Law The Strong Law 

Particle A Particle A 





Particle B 


Particle B 


Figure 22.4: The weak and strong form of Newton’s third law. 


22.1.3 A Particle System 

Animating a single particle is trivial, so in the following, Newton’s law of motion will be extended to 
include a system of particles. Given iV-particles, Newton’s second law for the entire system is given as 

Ff = miaf (22.7a) 

Ff = rn 2 af (22.7b) 

Ff = rn N af. (22.7c) 

The forces can now be split into external, Ff 1 . and internal forces, Ff. External forces come from 
interactions from outside the particle system and internal forces arc due to interactions inside the particle 
system, that is, interactions in between the particles themselves. Splitting the forces gives 

Ff = F ext + F int = mia tot (22.8a) 

Ff = Ff + Ff = m 2 af (22.8b) 

Ff = Ff + Ff = m N af. (22.8c) 

Since internal forces arc due to other particles, it is found that the total internal force acting on particle i 
is given by the summation 

Ff = ]T Fji, (22.9) 
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where F r j is the force with which the y’th particle affects the f th particle. The sum over all particles may 
thus be written as _ 

Ft ot = E F * xt +^ int = E F * xt + E E F n- ( 22 - 1Q ) 

i i i j,i^j 


However, the weak form of Newton’s third law states that 

Fji = ~Fji, 

implying that all internal forces cancel each other, that is, 

EE f c»- 

* j,i¥=3 


(22.11) 


( 22 . 12 ) 


This means that the total effect on the entire particle system is given entirely by the sum of the external 
forces. Hence, we have 


F lol = E FT 1 

i 

= E {maf). 


(22.13a) 


(22.13b) 


The total mass, M , of the particle system is given by 


M = E m i- 


(22.14) 


We define a cm as the mass weighted mean of particle accelerations 


(22.15) 


This is called the acceleration of the center of mass (CM) position. Exactly what this center of mass 
position is, will become clearer later. However, we can now write Newton’s second law for a system of 
particles as 

Ft , ot = Ma cm . (22.16) 

The implication is that the motion of an entire particle system, regardless of the number of particles, may 
be described as a single particle with mass M. But what is the position of this special particle? Since 
the acceleration is the double time derivative of a position, a cm = r cm , we can find the center of mass 
position, r cm , by integrating the acceleration twice with respect to time. 

By the superposition principle (Newton’s second law) and the definition of total mass we have 


(22.17) 
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Integrating twice with respect to time (and setting integration constants equal to zero), we find that the 
position of the center of mass (CM) is 


r C m = ( m * r *) • (22.18) 

i 

That is, the particle system moves like a single particle with position r cm and mass M. 

The Newtonian physics for a real-world object may be derived from a particle system by thinking of 
a real-world object as a collection of infinitely many particles. Letting the number of particles, N, go to 
infinity 

1 1 f 

r cm = lim — (miVi) = — / r(t) dm. (22.19) 

N^oc M ^ v ’ M J K ’ 

i 

and introducing the notion of the mass density p = dm/dV allows us to rewrite into a volume integral, 

rCm= If jy PrdV ' (22 ' 20) 

For rigid bodies, the mass density, p, is usually assumed to be constant throughout the entire rigid body, 
implying that the total mass of the rigid body is given by M = pV. 

It is thus concluded that the center of mass of an object behaves as a single particle with position r cm 
and mass M. In the above, it has implicitly been assumed that all particles have a fixed relative position 
w.r.t. each other, and that the particles constitute a rigid body. For simulation purposes, this means that the 
motion of a rigid body may be found by 


d 2 r cm _ _F 
dt 2 '' M' 


( 22 . 21 ) 


where F is the sum of all external forces. This is a second-order ordinary differential equation (ODE), 
which is easily solved as a system of coupled first-order differential equations, 

(22.22a) 

(22.22b) 

A thorough discussion on ordinary differential equations is found in Section 23.1. 


dv c 


dt 


F 

M 


dr c 


dt 


= Ve¬ 


il.\ A The Body Frame 

A brief summary of the previous sections is as follows: 

• The world coordinate system (WCS) may be derived from Newton’s first law. 

• The center of mass (CM) is found using Newton’s second law on a particle system. 
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From these two results, we know how to describe the motion of the center of mass. However, rigid bodies 
moving in 3D space not only change position, they also change orientation. In order to describe the 
rotational movement, a coordinate frame is placed with its origin at center of mass and its axes fixed w.r.t. 
the orientation of the rigid body. This is known as the body frame ( BF ) and is illustrated in Figure 22.5. 

One possible way to describe the orientation of the body frame is by Euler angles, which can be 
written in vector form as 

e = e x i + o y j + e z k, ( 22 . 23 ) 

where i, j, and k arc the world coordinate axes, and 9 X , 9 y , and 9 Z arc scalars. The Euler angles determine 
the rotation of BF relative to WCS, as depicted in Figure 22.6. Unfortunately, the rotation of objects using 
Euler angles is not commutative. That is, permuting the order of rotations results in different rotations, as 
illustrated in Figure 22.7. The X — Y — Z order is commonly called fixed axis angles or roll, pitch, and 
yaw angles. 

Although the Euler angles do not commute, their time derivatives arc simple to formulate and do 
commute. These arc known as the angular velocity. 


d9 x 

u> = ——i + 


dt 


d9 y 

dt 


j + 


d£z 

dt 


k. 


(22.24) 


The direction of the angular velocity gives the rotation axis and the length is the rotation speed. 

The absolute value of each component describes how fast the object spins around the corresponding 
axis, and the sign indicates the direction of rotation around the axis. Positive means counterclockwise and 
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wcs 


Figure 22.6: The Euler angles describe the orientation of a rigid body. 


minus means clockwise. Furthermore, the angular velocity obeys the supeiposition principle. 

To familiarize ourselves with the angular velocity it helps to look at the simple case where 9 y = 0 and 
9 Z = 0 and 9 X = ct, for some positive constant c. In this case, we derive 


dct . dO . d0 , 

U) = ~A %+ A 3 + A k = 


(22.25) 


We see that the angular velocity is a vector along the x-axis, which is also the axis of rotation, the length 
of the angular velocity is c, which is the rotational speed (traveling distance divided by traveling time). 
By setting 9 Z = ct, we see that we also get a rotation around the z-axis, and the angular velocity now lies 
in the x-z plane indicating both a rotation around the x and z axes, as expected. 


22.1.5 Velocity in a Particle System 

Knowing the center of mass position, r cm , of a particle system, then the position, r t , of the i’th particle 
can be written as 


Ti — T r cm , 


(22.26) 














“book” — 2005/9/30 — 15:44 — page 655 — #667 


22.1 PHYSICS-BASED ANIMATION 655 



(b) Z-X order 


Figure 22.7: Rotation using Euler angles does not commute. 

where Ar * is a body-fixed vector, denoting the i’th particle position relative to the center of mass position, 
as illustrated in Figure 22.8. By differentiation (22.26) w.r.t. time, the velocity of the fth particle is 

— Avi T v cm — lo x Avi -|-"ncm* (22.27) 

'- „ -' 

Ar; 

Observe that the time derivative of the body-fixed vector At*,; is obtained by the cross product with the 
angular velocity. 

This relation between the time-derivative of a body-fixed vector and the angular velocity will be proven 
in the following: to ease notation, the particle position in the body frame will be denoted s instead of Ar j. 
The vector, s, is rotating around a rotation axis, and the tip of the s-vector traces out a circular path. Let 
the unit vector n denote the rotation axis, and Ad the rotation angle around n. Since the rotation axis 
passes through the origin of the body frame, the angle, 6, between s and n is constant. The circular path 
of the tip of s has radius ||s|| sin (A), and after some small time-step At. the displacement As of s will 
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WCS 



Figure 22.8: The position of the i’th particle in a particle system. 


be 


As = s(t + At) — s(t). 


The magnitude of the displacement must then be 


|As|| 2 =2 


| s 11 o sin(0) sin 



(22.28) 


(22.29) 


which follows from trigonometry as can be seen in Figure 22.9. The small angle approximation. 
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axis of rotation 


s(t) \^phi I ^ /s(t+dt) 



l|s||sin(phi)^ ; \ 

/ theta/2~i 


||s||sin(phi)sin(theta/2) 


Figure 22.9: The angular velocity. 


is good, when Ad is very small, and hence 


As|| 2 « ||s|| 2 sin(0)A0. 


(22.31) 


If Ad occurs in time At. then 


Taking the limit as At —> 0, gives 


A«||„ n n ■ 

At " " 2 At 


(22.32) 


ds . Ad . dd 

— = lim s sm o -— = s smlffl) —. 

dt At^o" " 2 At " " 2 dt 
2 


(22.33) 


By definition u> = n4|, and thus 


cj = n —, 

1,2 ILhdt 


(22.34) 


By substitution, it is found that 


ds ii n • rjo n n d0 

— = s sin up n —. 

dt 11 " 2 " 2 dt 

2 


(22.35) 


From the right-hand side it is seen that this is the magnitude of x s. 


- = UJ X s . 

dt 11 " 2 

2 


(22.36) 


This indicates that either 


ds ds 

— = u x s or — = sxw. 
dt dt 


(22.37) 
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However, to ensure that the direction of these vectors arc sensible, ^ must be tangential to the circular 
path and pointing in the direction of rotation. Using the right-hand rule, it is seen from Figure 22.9 that 
u> x s points in the same direction as the rotation. Therefore, we have 

ds 

— = ux s. (22.38) 

dt 

It has thus been proven that 

d/\r ■ 

A ri = —- 1 = u x Arq. (22.39) 

dt 


22.1.6 Euler’s Equation 

The angular momentum of a particle, L, is defined as 

L = r x p, (22.40) 

where r is the position of the particle, and p = rnv is the linear momentum of the particle. The torque, 
r, of a particle is given by 

r = r x F , (22.41) 

where F is the force acting on the particle. It is important to remember which coordinate frame L and r 
are given with respect to, since they have different values in different coordinate frames. The change of 
the angular momentum is given by its derivative, 


dL 

dt 

'ai 

l* 

-ts 

II 

(22.42a) 


dr dp 

= ii xp+ry ii 

(22.42b) 


= v x p+r x F 

(22.42c) 


0 



= r. 

(22.42d) 


The above relation between angular momentum and torque is known as Euler’s equation. In words, Euler’s 
equation is a rotational version of Newton’s second law. 

Euler’s equation is the stalling point for deriving the formulas for the rotational motion. By the 
definition of angular momentum and Newton’s second law, the total angular momentum, L to t , for a particle 
system is given as 

1/tot = 'y " Li (22.43a) 

i 

= Y, ( r * x Pi) (22.43b) 

i 

= x • 


(22.43c) 



“book” — 2005/9/30 — 15:44 — page 659 — #671 


22.1 PHYSICS-BASED ANIMATION 


659 


Using ry = Ar, + r cm , and ry = Ary + r cm . the above may be rewritten as 

L to t = y ((Ar ; + r cm ) x m; (Ar* + r cm )) • (22.44) 

i 


Expanding the cross product and rewriting the summation we find four terms: 



(22.45) 


Although seemingly complicated, this is a useful form for further analysis. To simplify, first use Ary = 
r — r cm , which implies that 


y, rtii Ar,: = y m t ; (ry - r cm ) 

i i 

(22.46a) 

= y m^ry - y nrir cm 

i i 

(22.46b) 

= y mj/y - r cm y m; 

(22.46c) 

Since M = mi, and r cm = J2 m i r i > we find that 


y m. Ary = y '/ryry - M r cm 

(22.47a) 

= Mr cm - Mr cm 

(22.47b) 

= 0. 

(22.47c) 

Furthermore, since the derivative of a constant is zero, it is found that 


y miAr.i = 0. 

(22.48) 


i 


Using these facts about m t Ary and myAry in the equation for the angular momentum, it is found 
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£tot — E (A r.i x iiijArj) + E rriiAri J xr 


X j ^ ^ TfljV cm + 


^2 m.Ar, 


(22.49a) 


^ (Ar* x miAr-i) + r cm x Mv c 


In a si mi lar way, the total torque, r to t, of a particle system is found to be 


Ttot = Ti = ^ ( r * X F i) = . . 

i i 

= (A n x Fi) + r cm x F. 


By definition, Euler’s equation for a particle system is 


T tot — 


(22.49b) 


(22.50a) 


(22.50b) 


(22.51a) 


^ ^ {At' i x Fi) H- 7*0111 x F — ( ^ ^ {Av-i x urtiAvi) + 7*0111 x Mv, 


Differentiating the right-hand side with respect to time gives 

yy (A Ti x Fi) + r cm x F = yy (Arq x rriiAri 


d „, d ,,, . 

4“ cm xil Iv cm + x — \Mv cm ) 


(22.51b) 


(22.52a) 


E< A r.j x rriiAri) + cm X F. 


(22.52b) 


Observe that we have a r cm x F term on both the left- and right-hand sides, which cancels each other. 
Implying that Euler’s equation for a particle system reduces to 


yy (Ar,; x Fi) = — yy (Ary x rriiAri 


(22.53) 


'T" cm 
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Surprisingly enough, all terms referring to the motion of the center of mass have disappeared, and the 
remaining terms describe the motion of the particle system around the center of mass. It is concluded 
that that linear and rotational motion may be treated independently. The equations for L cm and r cm may 
be conceptually simplified by introducing the concept of the inertia tensor. Briefly explained, the inertia 
tensor is the rotational equivalent of the total mass. It tells how mass is distributed in an object and thus 
gives information about how hard it is to rotate an object around any given axis. In the following, we will 
derive the inertia tensor. Consider the angular momentum of particle system w.r.t. to the center of mass, 


L cm — ^ ' (At*j x m-iArij) 

i 

= ^2 x m i (<*> X A Vi)) 

i 

= ^2,mi (A Vi x (u) x A Vi)) 


(22.54a) 

(22.54b) 

(22.54c) 


Using A Ti = [xi, i/i, Zi ] T , this can be written as 


mjArq x (u; x Arq 


I" {Vi + Z i) Ux - XiViUy - XiZiLU z 

yy rrii -XiViLOx + (xf + zf) ut y - yiZiU z 
\_-XiZiU x - ViZiLOy + (xf + yf) u z _ 

Ei (yf + z i) -T,i m i( x iyi) -E 
- Y,i m ( Xiyi ) Yli m i i x i + z i ) - Ei m i (Vi Z i) 

_ - E i m 'i ( x i z i ) - Ei m 'i (■ Vi z i ) Ei m i (■ x i + Vi) 


(22.55a) 


The matrix I is called the inertia tensor 


Using the inertia tensor, the equation for L cm simplifies to 


(22.55b) 


- Ei m i ( x iVi ) 

“ Ei m ’i ( x i z i) 

\ IXX 

Ixy 

Ixz 


Ei rrii (x? + zf) 

~ Ei m i {Vi z i) 

ii 

lyy 

lyz 

(22.56) 

~ Ei m ’i (: Vi z i ) 

Ei m i ( x f + vf)_ 

| J-zx 

Izy 

Izz 



L cm — I u). 


(22.57) 


It is observed from (22.56) that the inertia tensor is symmetrical, I = I T . The terms on the diagonal arc 
called products of inertia, and the off-diagonal terms arc called moments of inertia. The inertia tensor for 
a solid object is found by letting the number of particles in the system tend to infinity and rewriting the 
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sum as volume integrals, 


J v p(y 2 + z2 ) dv - 

(22.58a) 

J p(x 2 + z 2 ) dV, 

(22.58b) 

J^p(x 2 + y 2 ) dV, 

(22.58c) 

- / p(xy) dv, 

J V 

(22.5 8d) 

— p (xz) dV, 

Jv 

(22.58e) 

- [ p{y z ) dv. 

Jv 

(22.58f) 


Although the inertia tensor, I, conceptually simplifies the equation, the underlying mathematics is still 
complicated, since / is a function of orientation of the object. Fortunately, it can be split it into a sum, and 
for a particle system it may be written as 


[{yf + z i ) - ( x iVi) - ( x i z i ) 

7 = X/ m% ~ ( X i + Z i) ~ (Vi Z i) 

[ - ( XiZi ) - (yiZi) (xj + yf) 

f \ X 1 + Vi + Z 1 0 

= z2 m m 0 x2 i+y 2 i+ z ! 

i \ 0 Ox 


u X7 t Xiyi XiZi 

0 - x t y t yf yiZi 

X 1 + y 2 + z f x i z i Vi z i z i 


(22.59a) 


(22.59b) 


This may be recognized as the inner and outer product of the vector Ar, = [x t y t z,] , and the equation 
can be rewritten as 

I = Ar*jl — Ar,;Arf) , (22.60) 

i 

where 1 is the identity matrix. The term Ar , is the rotation, R, of some fixed vector, p,, in the body 
frame, that is, 

A ri = Rpj. (22.61) 


Substituting this gives 


1 = Y1 m < (( R Pif R p > 1 - R Pi ( R Pi) 

i 

= m (.Pi RT R P> 1 ~ R PiPi RT ) 

i 

= ^ j m i (pjp,l - RPiPi R 1 ) ■ 


(22.62a) 

(22.62b) 


(22.62c) 
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Since p{ p t is a scalar, the equation can be simplified further with a little mathematical trick: 


1 = { R PiPi RTl ~ R PiPf Rl ) 

i 

( \ 


= R 


{PfPi 1 ~ PiPi) 


V 


r t . 


(22.63a) 


(22.63b) 


1 body 


Comparing with (22.60), it is found that the inertia tensor may be expressed as the product of a fixed 
inertia tensor of the body frame and a rotation matrix. The inertia tensor of the body frame is thus given 
as 

-fbody = mi (Pi Pi 1 “ PiPi ) ' (22.64) 

i 

From a computer simulation viewpoint, this is tremendously practical, since I body can be precomputed, 
and I can always be found during simulation using the rotation R of the body frame. Given some thought, 
it should be clear that the inertia tensor on the form I = RI body-/? 7 is a consequence of the symmetry of 
I, since any symmetric matrix, A, may be written as (Proposition 18.3), 

A = RDR t , (22.65) 


where D is a diagonal matrix, with the eigenvalues of A along its diagonal, and R is an orthonormal 
matrix, where the columns arc the eigenvectors of A. This indicates a clever initial orientation of the 
body, where the body frame inertia tensor is a diagonal matrix, and R is then the rotation of this initial 
body frame. Furthermore, if the columns of the rotation matrix arc 

R= [ i j k ] , (22.66) 

then the columns arc the instantaneous placement of the initial body frame’s x-, y-, and z-axis. Since 
these axes arc fixed body vectors, their time derivatives can be found simply by taking the cross product 
with the angular velocity; the time derivative of the rotation matrix is thus, 

R = [ u) x i u) x j uj x k ] . (22.67) 

Using the cross product matrix notation from Proposition 18.4, we have 

R = uj x R. (22.68) 


22.1.7 Integrating over a Solid Object 

There arc three commonly used approaches for computing the mass, the center of mass, and the inertia 
tensor for a solid object. 
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Analytical Integration: In the first method volume integration is computed analytically. Consider an 
axis-aligned cube with side length 2 L and constant density, p. By definition, the I xx element of the inertia 
tensor is found as 


Ixx = P [ ( V 2 + z 2 ) dV 

Jv 


= P 


' —L J-L J-L 


(y 2 + z 2 ) dx dy dz 


16 r5 

= P^rL 5 


The volume of the cube is 

V = (2 Lf = 8 L 3 , 

and therefore it is found that p = ^ = ^-|4. Finally, it is found that 



(22.69a) 

(22.69b) 

(22.69c) 

(22.70) 

(22.71) 


A similar approach may be used for the remaining volume integrals. Unfortunately, the analytical solution 
is only simple for objects with very simple shape. 


Decomposition: The second approach decomposes an object into simpler objects, for instance, small 
cubes, voxels, as shown in Figure 22.10. The implication is that the analytical approach is tractable for 
cubes, and if all mass properties arc represented w.r.t. the same frame, then the mass properties of the 
simpler volumes may be added in order to find an approximation to the original object. 


T, v > 

(22.72a) 


(22.72b) 


(22.72c) 


(22.72d) 


In order to sum the inertia tensor of each cube, we need to rotate and translate each inertia tensor into the 
same coordinate frame. 

Imagine we have an analytical formula for computing the inertia tensor in frame G, and we need to 
transform the inertia tensor into coordinate frame F. Rotation is simply. 
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Figure 22.10: A solid object may be decomposed into a set of smaller cubes, voxels. 


If frame F is translated by \x,y,z] T w.r.t. frame G, but both F and G have the same orientation, then 
inertia tensor can be computed by the transfer-of-axe theorem [Kleppner et al., 1978], which states that 


lg = lg + M(y 2 + z 2 ) (22.74a) 

Iyy = 1% + M (x 2 + z 2 ) (22.74b) 

IL = % + M (x 2 + y 2 ) {22.1 Ac) 

Ixy = J xy - M (xy) (22.74d) 

lg = lg-M(xz) {22.1 Ac) 

lg = I? y -M(yz). {22.1 Af) 


Dimension Reduction and Value Propagation: Finally, the integration may be performed using di¬ 
mension reduction and value propagation [Mirtich, 1996]. The basic idea is to use Stokes’ and Gauss’s 
theorems to reduce volume integrals to line integrals. This will not be discussed further. 

Comparing the three approaches, the analytical integration is accurate within numerical precision, but 
also tedious to work with, since complex shapes yield integrals that arc difficult to solve analytically by 
hand. The decomposition approach is quite simple, but requires that one can make a decomposition of 
simpler volumes, which approximates the real object. The accuracy depends on how close the approxi¬ 
mation is to the real object. Decompositions into spheres, boxes, and cylinders arc often hand modeled in 
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interactive applications, such as computer games and used for collision detection. These decompositions 
can therefore easily be reused to compute the needed volume integrals. The last approach of dimension 
reduction and value propagation can be applied to any polygonal closed surface mesh, and the method is 
accurate within numerical precision. However, it is not always possible to obtain a closed surface mesh. 
In practice, polygonal meshes often have open boundaries, flipped faces, and many more degeneracies. 


22.1.8 A Summary of Newton’s Equations of Motion 

Newton’s equations of motion for the linear and Euler’s equation of rotational motion of a rigid body have 
been derived in the previous section, and they may be summarized as: 


dv cm 

dt 

d J P cm 


dt 

dR 

dt 

dL cm 

dt 


^cm 

(22.75a) 

F 

(22.75b) 

u x R 

(22.75c) 

T cm- 

(22.75d) 


The first two described the linear motion, and the following two describe the rotational motion. These 
four equations are commonly referred to as the Newton-Euler equations of motion. 

The three auxiliary equations arc 


(22.76a) 
(22.76b) 
(22.76c) 

with the mass and center of mass given by 


I 1 = ri~^r t 
CO = - I Lr m 


— 


Pc m 

M ’ 


M = [ pdV ., 

Jv 

r ™ = h! v priv ' 


C 22.77a) 
(22.77b) 


£■ 


© 


© 


© 
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and the inertia tensor by 


Ixx — 

J^p(y 2 + Z 2 ) dV. 

(22.78a) 

II 

--- 

J p (x 2 + z 2 ) dV, 

(22.78b) 

Izz — 

J^p(x 2 + y 2 ) dV, 

(22.78c) 

Ixy — 

- / p(xy) dv, 

Jv 

(22.7 8d) 

Ixz = 

— Ip (xz) dV, 

Jv 

(22.78e) 

Izy = 

- [ p{yz) dv. 

(22.78f) 


'V 


The equations of motion arc usually rewritten in terns of the state function, Y (t) as follows: 


Y{t) = 


f cm 



rim 

R 

and 

dY(t) 

u x R 

Pcm 

dt 

F 

Ecm 



X cm 


(22.79) 


and using quaternions, which arc described in detail in Section 18.5). Then Proposition 18.46 gives 


R = u x R 


the state function may be written as 


Q = 


1 

2 


[0, u] q, 


Y(t ) 


T cm 



rim 

q 

and 

dY(t) 

\ [0, w] q 

Pc m 

dt 

F 

Ecm 



X cm 


(22.80) 


(22.81) 


This completes the Newtonian description of the unconstrained motion of a rigid body. Knowing the initial 
position and orientation, the motion of the rigid bodies is found through integrations; see Chapter 23. 


22.2 Coulomb’s Friction Law 

A frequently used friction law is Coulomb’s friction law, and it will be discussed here for a single-point 
contact (we refer the reader to [Trinkle et al., 2001] for an example of other friction laws). 

Consider two rigid bodies touching at a single point of contact, as illustrated in Figure 22.11. At the 
point of contact the surfaces of the two rigid bodies have parallel surface normals. The direction of the 
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surface normals identifies a unit contact normal vector, n. The point of contact and the contact normal 
describes a contact plane orthogonal to the contact normal. A unit vector lying in this contact plane is 
denoted by t and referred to as the tangential direction. A force acting at the contact point is known as 
a contact force, F contact , and is decomposed into two components: a normal force, F n , parallel to the 
contact normal direction, and a friction force, F t , lying in the contact plane. 

-^contact = F n + Ft- (22.82) 

The normal force is equal in magnitude and opposite to the weight of the objects in the normal direction. 
If the relative velocity at the point of contact by the vector, u, then the tangential component. 


u t = u — (u ■ n) n, 


(22.83) 


describes the amount and direction of sliding between the two rigid bodies. 

Coulomb’s friction law relates the magnitude of the normal force to the magnitude of the friction force 
through a coefficient of friction, p. The actual relationship depends on the state of contact, and there is a 
total of three different states to distinguish between: 

• Dynamic (sliding) friction 

• Stable static (dry) friction 

• Unstable static (dry) friction 


Dynamic (sliding) friction occurs when the two bodies move relative to each other, indicating that the 
tangential relative velocity is nonzero. In this case, the friction force should attain its maximum possible 
magnitude, and directly oppose the relative movement. That is, 


Ut 0 =y- Ft — p 




(22.84) 


Stable static (dry) friction occurs when the two bodies do not move relatively to each other, meaning that 
the relative tangential velocity is zero. In the stable case this relation is maintained, indicating that the 
relative tangential acceleration is also zero. When these conditions occur Coulomb’s friction law states 
that the magnitude of the friction force is less than or equal to the magnitude of the normal force multiplied 
by the friction coefficient. However, no constraints arc placed on the direction of the friction force. 


u t = 0 and it t = 0 \\F t \\ 2 < p\\F n \\ 2 . (22.85) 

In unstable static friction, the conditions for this case arc nearly identical to stable static friction; the 
difference is that the relative tangential acceleration is nonzero, indicating that the tangential velocity is 
just at the point of becoming nonzero. The magnitude of friction force is required to be as large as is 
possible in the case of dynamic friction; however, for unstable static friction the direction of the friction 
force is opposing the direction of the relative tangential acceleration. 


u t = 0 and ii t f 0 => \\ F t\\ 2 = b \\F n \\ 2 and F t ■ ii t < 0. (22.86) 


© 


© 


© 
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Figure 22.11: 2D illustration of a friction cone of a ball sliding down an inclined plane. W is the weight 
of the ball. 


Observe however, that it is not required that the friction force directly opposes the direction of relative 
tangential friction. The name unstable indicates that a transition can occur either to dynamic or stable 
static friction. 

An often-used geometric approach for describing the region of legal contact forces that can be applied 
at a contact point is the friction cone. The friction cone is simply the region of space traced out by all 
possible values of 

-^contact = F n + Ft (22.87) 

Figure 22.11 shows a 2D example of a friction cone. The slope of the friction cone is dictated by the 
reciprocal value of the friction coefficient //. For dynamic friction, the contact force is restricted to lie on 
a straight line on the surface of the friction cone, and the straight line lies on the side of the friction cone 
where the relative tangential velocity is pointing away from. For stable static friction, the contact force 
can be anywhere interior or on the surface of the friction cone. For unstable static friction, the contact 
force is restrained from being inside the friction cone, and instead it must be on the half-surface lying in 
the opposite direction of the relative tangential acceleration. 



“book” — 2005/9/30 — 1^:44 — page 670 — 


CHAPTER 22. BASIC CLASSICAL MECHANICS 


22.3 Work and Energy 

Looking at a single particle again, we arc sometimes facing the problem of determining the trajectory of 
the particle given its equation of motion, that is, 


dv 

m— = F(r). 


( 22 . 88 ) 


The problem with this is that the laws of nature often describe forces as functions of position, that is, 
F(r), and not as functions of time, which is what we really want in order to solve the above equation in a 
mathematically convenient way. 

Imagine we knew the starting and ending points on the particle trajectory. We label these A and B, 
and then we can integrate both the left- and right-hand sides of the above equation with respect to position 
and obtain the following derivation 


Now we use the following identity rule 

1 d_ 

2 dt 


x dv 

F (r) = m—, 

(22.89a) 

-) ■ dr = [ m-j- • dr, 

J A dt 

(22.89b) 

f B dv 

= / m— ■ vat. 

J A dt 

(22.89c) 

2 dt v ; ’ 

(22.90a) 

1 (dA A „ dA\ 

~ 2 \ dt ' + '~dt) i 

(22.90b) 

II 

& ^ 

(22.90c) 


which is really just a clever rewriting of the product differentiation rule. Applying the identity to our 
derivation, we have 


rB rB ^ ^ 

J A p (r)-dr = J A Idt, 

1 „ m 2 1 n m2 

= \\ v b \\ 2 - Ipa || 2 ■ 

ii 11 2 

Traditionally, physicists write v = ||v|| ; using this notation we can write: 

1 9 1 

J a F(r) • dr = -mv 2 B - -mv\, 


(22.91a) 

(22.91b) 

(22.92a) 


W BA = K b - K a . 


(22.92b) 
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Here, f f F(r ) • dr is termed the work, Wba, done by force F from A to B\ the terms \mv 2 arc called 
the kinetic energy K. Sometimes the notations K = T = E\, :in arc also used. The simple looking equation 

Wba = K B -K A , (22.93) 

is known as the work-energy theorem. From the above derivations it is seen that the work A IF done by 
the force, F, in a small displacement A r is 


AW = F ■ Ar. 


(22.94) 


Splitting the force into an orthogonal and parallel component along Ar, we immediately see that only 
the component of the force parallel with the path contributes to the work done by the force. Thus, forces 
orthogonal to the path do not contribute to the work. 

If the work integral is not dependent on the actual path between A and B, but only on the positions A 
and B, then the force F is called a conservative force. 

Example 22.1 

,4 a' an example of a conservative force, consider the gravitational field, where F = mg. In this case, 


Usually, g 


[0,0, — g] T , where g 


j'B 


Wba= / F(r) ■ dr, 

(22.95a) 

Ja 


r B 


= / mg ■ dr. 

(22.95b) 

Ja 


[B 


= mg / -dr, 

(22.95c) 

Ja 


= mg • (r B - r A ). 

(22.95d) 

9.8 m/s 2 , so 


Wba = mg (z A - z B ) ■ 

(22.96) 


In conclusion, the work done by gravity depends only on the starting and ending positions. 
Whenever we have conservative forces we can write the work integral in the form 

rB 


or 


f 

/ F{r ) • dr = function of(r^) — function of(r^ 4 ), 

■J A 

[ F(r) ■ dr = -U(r B ) + U(r A ), 

Ja 


(22.97) 


(22.98) 


where U(r) is a function defined by the above expression and U is called the potential energy. The sign 
convention might appeal - a little awkward at the moment; however, if we look at the work-energy theorem 
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for a conservative force we have: 


Wba = K b - K a 

(22.99a) 



+ U A = K b - K a 

(22.99b) 



+ Ua = Eb + Ub, 

(22.99c) 


from which the chosen sign convention should be obvious. Sometimes the notation V is used instead of 
U. However, we will stick with U in this section. Looking at the left-hand side of the equation we see 
that we only have kinetic and potential energy at A, and the right-hand side only has energies at B, thus 
the sum K + U is conserved. K + U is called the mechanical energy, (sometimes just the energy), and it 
is often denoted by E or L me |<. The conservation of mechanical energy is derived directly from Newton’s 
second law. We say it is a derived law and it is a special case of the more general energy conservation law, 
which states that if we look at all energies involved (heat, light, mass, etc.) then their sum is conserved. 

In our derivation of the conservation of the mechanical energy, we assumed that the work was only 
done by a conservative force. If we have a nonconservative force, we divide the applied force into two 
terms: one equal to the sum of conservative forces, F c, and another equal to the sum of nonconservative 
forces, F nc , 


& 


F = F c + -Fnc- 


The total work done by F from .4 to B is 


Try total 
11 BA 



F(r) ■ dr, 


(Fc + F NC ) • dr, 

Fc + [ F N c ■ dr, 
Ja 


-U b + U A 

-U b + U A 





+ Wi 


NC 
BA • 


( 22 . 100 ) 


(22.101a) 

(22.101b) 

(22.101c) 

(22.101d) 

(22. lOle) 


0 


0 
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The work-energy theorem, If©© = Kb — Ka- now has the form 

- U B + uA + Wba = Kb - Ka , (22.102a) 

Wba = Kb + U b - Ka - U A , (22.102b) 

Wgf = K b + Ub ~ ( K A + U a ), (22.102c) 

E b E a 

W^ = E b -E a . (22.102d) 


We see that the mechanical energy E is no longer constant but depends on the state of the system. A typical 
and often-used example of a nonconservative force is the friction force. The friction force is parallel to 
the path, but points in the opposite direction, so 11)© 11011 < 0. Thus, energy is dissipated from the system 
implying that Eb < Ea- How much Eb decreases depends on how long the particle is moved along the 
path. 

Let’s turn our attention back to the definition of potential energy, 

U B -U A = - f F-dr. (22.103) 

J A 

Consider the change in potential energy when a particle undergoes the displacement A r, 


U(r + A r) — U(r) 


r+Ar 


F(r') ■ dr'. 


(22.104) 


The left-hand side is the difference in U at the two ends of the path, A U. If A r is so small that F does 
not change significantly over the path, F can be regarded as constant. Thus, we have 


A U = -F ■ Ar, 

= - (F x Ax + FyA-y + F z Az). 


(22.105 a) 
(22.105b) 


In the last step, we have written the right-hand side in terms of Coordinates. Another way to write AU is 


as a first-order differential, 


Arr dU . dU . dU . 
AU ~ -tj-Az + -7—A y + —Az, 
ox ay oz 


(22.106) 


combining the equations we have 


dU . dU A 8U . 

+ ^“ A2: 

ox ay oz 


—F x Ax — F y Ay — F z Az. 


(22.107) 


© 


© 


© 
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In the limit (Ax, A y, A z) —► 0, the approximation becomes exact. Since Ax, Ay, and A 2 arc indepen¬ 
dent, the coefficient on either side of the equation must be equal, meaning 


F 

1 X — 0 > 

OX 

(22.108a) 

f--*L 

V dy ’ 

(22.108b) 

p -JJL 

z dz ‘ 

(22.108c) 

Using the gradient operator, V, these three equations are written as 


F = -VU. 

(22.109) 


Not only does this equation allow us to find the force given the potential energy function, but it is also a 
valuable tool when physicists analyze the stability of a system in terms of energy diagrams. The observa¬ 
tion is that F is pointing toward minima of U. 

Living in the real world, as humans, we are often quite used to measuring things in terms of absolute 
measures; energy is a little tricky in this manner. In fact, the value of E is to a certain extent arbitrary, 
only changes in E have physical significance. Let’s look at the potential energy definition once again. 

U B -U A = - [ F-dr (22.110) 

Ja 

defines only the difference in potential energy between A and B and not potential energy itself. For 
instance, we could add any constant to Ua and the same constant to Ub without violating the above 
definition, but we would change the value of E, since E = K + U. 

In the above, we have introduced the concepts of work, mechanical energy, kinetic energy, potential 
energy, etc. for a single particle. We will now extend the concept of kinetic energy to a particle system, 
which in the limiting case, yields the kinetic energy for a rigid body (the same recipe was used in sec¬ 
tion 22.1). For a particle system, the total kinetic energy is simply given by the sum of the kinetic energy 
of each particle. We have 

K = \ m j v2 j • ( 22 - 11 !) 

3 

From Section 22.1.5 we know that for a particle in a rigid body, 

Vj = v cm + u> x Arj, (22.112) 

'-v- 

Avj 

where the particle position is given by r y = r cm + Ar ? , so A rj is a vector from the center of mass to the 
particle position. Inserting into the kinetic energy, we have 

K = i rrij ( v cm + u> x A r© ■ (v cm + u> x At© , (22.113) 

3 


© 


© 


© 
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with some mathematical manipulation this yields 

K = \ m j (■ v cm • -fern + 2 (u> x A rj) ■ v cm + (<jj x Arj) • (u> x Ar,)), 

3 

= T 2 Mv lm + «cm • X Ar i) + X! X Ar i) • ( W X Ar j) • 

3 3 

= + V cm ■ m 3 Av 3 + X] \ m l ( W X Ar ?) ' X Ar i) ’ 

3 3 

' -v-' 

=0 


(22.114a) 

(22.114b) 

(22.114c) 


= + x Ar ?) ' ( w x Ar i) • (22.114d) 

3 

Using the vector identity {A x B) ■ C = A ■ (B x C), with A = u), B = A rj, and C = x Arj, 
yields 


\ M Vcm + ' ( Ar i X (w X At - j )) , 

j 

(22.115a) 

\ M Vcm + ■ J2 m 3 ( Ar i X (w X A rj)) , 

j 

(22.115b) 

T^cm + • J2 ( Ar i x ( m 3 Av i ))> 

j 

(22.115c) 

' -„-' 

-t'em 


—Mv cm + —u) ■ L cm . 

(22.115d) 


Taking the limit and recalling that L = Iu>, we end up with the following formula for the kinetic energy 
of a rigid body: 

K= l -Mv 2 cm + \yiu. (22.116) 

The first term is due to translational motion only and the second term is due to rotational motion only; 
thus, they are respectively called the translational kinetic energy and the rotational kinetic energy. This 
concludes out treatment of classical rigid body mechanics. 


22.4 The Harmonic Oscillator 

In the preceding sections, we treated the basic theory of classical rigid body mechanics. In this section, we 
will present the basic theory of the harmonic oscillator. A harmonic oscillator is a system occurring re¬ 
peatedly in animation systems and is referred to as springs. First, a single-object system will be discussed 
followed by a two-object system. 
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Figure 22.12: The one-mass harmonic oscillator. 


22.4.1 The One-Object Harmonic Oscillator 

Consider a particle attached to a wall of infinite mass with a spring, as shown in Figure 22.12. The spring 
force given by 

Fspnng = -kx, (22.117) 

is called Hooke’s spring law, and k > 0 is called the spring coefficient. Newton’s second law of motion 
dictates the motion of the particle as 

mx = —kx, (22.118) 

which yields the second-order ordinary differential equation 

mx + kx = 0. (22.119) 


An analytical solution to this equation exists and can be shown to be 

x = B sin (cuo t) + C cos (luq t), 

where 

CJ 0 = 


( 22 . 120 ) 


( 22 . 121 ) 


is known as the natural frequency. The constants B and C can be evaluated from the initial position and 
velocity of the particle. Using the trigonometric identity 


cos (a + b) = cos (a) cos ( b ) — sin (a) sin ( b). 


( 22 . 122 ) 
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Undamped Harmonic Oscillator 



Figure 22.13: The amplitude of an undamped harmonic oscillator plotted as a function of time. 


The analytical solution (22.120) can be rewritten as a single cosine function, 

x = A cos (coot + q i>). (22.123) 

Setting (22.123) equal to (22.120) gives 


A cos (coot) cos ( 4> ) 

— A sin (coot) sin (f) = B sin (coot) + C cos (coot). 

(22.124) 

The implication is that 

A cos (f) = C, 

(22.125a) 


A sin (f) = —B, 

(22.125b) 

and that 

A= \J B 2 + C 2 , 

(22.126) 

and 

,,, B 

tan(<p) = - —. 

(22.127) 


The particle will follow a cosine motion with amplitude A, frequency to o, and phase 0. The motion is 
consequently repetitive with a period T, 



LOq 



(22.128) 


Figure 22.13 shows an example of a harmonic oscillator. 
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The idealized harmonic oscillator assumes that there is no loss in energy over time. This is physically 
unrealistic, and practically not as useful as a damped harmonic oscillator. A typical damping force is 
given by 

-^damping = bx, (22.129) 

which is commonly referred to as a linear viscosity force, and the coefficient 6 > 0 is called the damping 
or viscosity coefficient. From Newton’s second law of motion, the motion of the damped particle must 
obey, 

rrix = — kx — bx, (22.130) 

which is equivalent to the second-order ordinary differential equation 


rrix + bx + kx = 0 , 


(22.131) 


or 


b k 

x 4 - x -|- x = 0 . 

m m 


(22.132) 


Using the natural frequency u>q = — and introducing 7 = —, this may be written as 


... 9 

x + yx + u>qX = 0 . 


(22.133) 


An analytical solution exists and is stated here without proof as 

x = A exp cos (uit + <f >), 

where A and f arc constants determined from initial conditions, and 


w i = \/ w o-^ 


is the frequency. The frequency has real solutions when 

„2 


wo - Y - 0 


m 


4m 2 


> 0 

4 km — b 2 > 0 . 


(22.134) 

(22.135) 

(22.136a) 

(22.136b) 

(22.136c) 


Introducing the notation 


Aft) = A exp 


(22.137) 
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Lightly Damped Harmonic Oscillator 



Figure 22.14: The amplitude of a lightly damped harmonic oscillator plotted as a function of time. 


the damped harmonic oscillator is written as 

x = A(t) cos (u\t + <f >), (22.138) 


which is a cosine wave with a time-varying amplitude. 

Comparing the damped harmonic oscillator (22.134) with the undamped harmonic oscillator (22.123), 
we see that the amplitude of the damped oscillator is exponentially decaying and the frequency co \ of the 
damped oscillator is less than the frequency, ccq, of the undamped oscillator. In Figure 22.14 the amplitude 
of a damped harmonic oscillator is shown. 

The zero-crossings of the damped harmonic oscillator occurs at equal time intervals by the period 

T = —. (22.139) 

ui 


Surprisingly though, the peaks of the motion do not he halfway between the zero-crossings. 

The damped motion may be described qualitatively by examining the value of b 2 — Arnk and the ratio 


of 




(22.140) 


The motion is: 


Over damped, when b 2 — 4rnk > 0. Two cases of over-damping can be distinguished: 

Lightly damped, if 27- <C 1 then A(t) decays very little during the time the cosine makes many 
zero crossings. 

Heavily damped, if is large then A(t) rapidly goes to zero while the cosine makes only a few 
oscillations. 
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Lightly Damped Harmonic Oscillator 


Heavily Damped Harmonic Oscillator 





time (seconds) 


time (seconds) 


Figure 22.15: A qualitative analysis of a damped harmonic oscillator. Amplitudes arc plotted as functions 
of time. 


Critically damped, when b 2 — Arrik = 0. In this case, the amplitude tends rapidly to zero (without 
crossing zero) and flattens out. 

Under damped, when b 2 — Amk < 0. In this case, the amplitude decays even faster than in the case of 
critical damping, but the amplitude crosses zero and increases again before flattening out. 

Examples of the four categories arc illustrated in Figure 22.15. 

Using a harmonic oscillator for a mass-spring system (see Section 8.4.1) to model penalty forces 
(see Chapter 5), reasonable motion is obtained for heavily damped, or at best, critically damped, since 
oscillating forces such as oscillating contact forces arc often undesirable. Furthermore, it is often desirable 
to require that the amplitude should decay to zero within a time-step, since this will make objects appear 
rigid. 


© 


© 
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Figure 22.16: The two-body harmonic oscillator. 


22.4.2 The Two-Object Harmonic Oscillator 

Consider a two-body system, as shown in Figure 22.16, where two particles arc connected with a spring 
and label uniquely by the labels i and j. Hooke’s law for this system is given as the following two 
equations: 

rriiXi = k (.Xj — xi ), (22.141a) 

rrijXj = —k (xj — Xi) , (22.141b) 


where x t is the position of the f th particle and m; the mass. Similarly, Xj and rrij arc the position and 
mass of the j'th particle. Dividing the first equation by rt:, and the second by rrij, yields 


Xi = k— (xj - Xi), 


rrij 


1 


Xj = —k — (.Xj — Xi), 


m 


and subtracting the first from the second yields 


rp . _ rp . - - !/> _ I rp . _ rp . \ _ Ip _ I rp . _ rp . | 

•as J •A-'t - *V yUs j •as % j m yJU j •*s%) 


m 


= -k 


m-i 

1 1 \ , 

- 1 - )[Xj- Xi) . 

nri j rrii J 




(22.142a) 

(22.142b) 

(22.143a) 

(22.143b) 


The two-object harmonic oscillator has the same motion as the one-object harmonic oscillator. This 
may be seen by setting, x = Xj — Xi, in (22.143b), such that 



if 1 1 A 

x = —k -1- x 

\rrij rriij 

(22.144) 

Introducing the reduced mass as 

rriimj 

F = , , 

+ rrij 

(22.145) 

it is discovered that 

fix = —kx. 

(22.146) 


It is thus concluded that the two-object harmonic oscillator behaves as a one-object system of mass fi. 
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22.5 The Lagrange Formulation 


The Lagrange formulation will be discussed in the following. This section is written with an emphasis on 
obtaining a realistic implementation of a physical simulation rather than a complete physical and mathe¬ 
matical understanding. The scope of this section is the derivation of [Terzopoulos et al., 1987, Equation 1], 


shown here: 



dr 8e 


(22.147) 


We will use a notation similar to [Goldstein, 1980]. The physical theory is based on [Goldstein, 1980, 
Chapters 1, 2 and 13], and the calculus of variation used is discussed in detail in Chapter 21. 


22.5.1 Lagrange’s Equation 

If the total force on each particle in a particle system is zero, then the system is in equilibrium , and 

Fi= 0, Vi. (22.148) 

The virtual work of all the forces in a virtual displacement, dr,;, is by definition 

Y J F l -5r i = 0, (22.149) 

i 

because all EjS arc zero. A virtual displacement is an infinitesimal instantaneous Active displacement, 
meaning it is infinitely small, there is no time duration, and the displacement never occurs in the real 
world. Virtual displacements arc a useful mathematical trick, which we will use in the following. The 
only requirement for using virtual displacements is that their use must be consistent with the forces and 
constraints in the system. 

Newton’s second law (22.5) for the i’th particle is given by 


Ft 

= Pi > 

(22.150a) 

4 

Fi - Pi 

= 0. 

(22.150b) 

The total force can be split into two: the applied forces, F\ a \ and the constraint forces, f\ c ' 1 : 

( F (a) +F ( C ) ) _p. = 0. 

(22.151) 

Using a virtual displacement, the work performed is 

E ((*f + ’) 

i 

- Pi) • Sri = 0, 

(22.152) 

and expanding the terms gives 

E V"’ - *•, +E F < c> ■ 

i i 

Sri - 5^Pi • Sri = 0. 

i 

(22.153) 
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Consider a particle constrained to move in a plane. Any constraint force on this particle must be perpen¬ 
dicular to the plane. Since any virtual displacement must be consistent with the constraints in the system, 
they must lie in the plane and thus be orthogonal to the constraint forces. In this case, the virtual work of 
the constraint forces is therefore zero. The discussion in the following will be restricted to systems where 
the net virtual work of the constraint forces is zero. This is D’Alembert’s principle and reduces (22.153) 
to: 

5^ {F ( f - Pi) ' Sr t = 0. (22.154) 

i 

The vectors r, arc not necessarily independent of each other. It is useful to transform them into indepen¬ 
dent coordinates, qj. To see why this is useful, recall from linear algebra that a linear combination of a set 
of independent vectors, v„ can only be zero if all the coefficients, a, , arc zero. 

a\V\ + • • • + a n v n = 0 =>• a\ = • • • = a n = 0. (22.155) 

Hence, given a set of transformed coordinates that are mutually independent and D’Alembert’s principle 
(22.154), we can conclude that F^ a> -p- arc zero for all i. It will be assumed that such a set of independent 
coordinates exists together with a set of transformation equations. The transformed coordinates q, will be 
called the generalized coordinates. 


r\ = ri(qi,q 2 ,.. 

• 5 Qni t)i 

(22.156a) 

r2 = r 2 (qi,q 2 ,.. 

■ i Quit) > 

(22.156b) 

TN = r N (qi,q 2 ,. 

• • i Qnt /)> 

(22.156c) 


where N > n, and k = N — n is the redundant degrees of freedoms removed from the representation. 
Taking the derivative of the f th particle 


d dr, dri 


(22.157) 


reveals a method for converting between the two set of coordinates, which is the f th row of the Jacobian of 
the transformation multiplied by the generalized velocity vector of the system. The virtual displacement 
in the generalized coordinates dr, is naturally a linear combination of the virtual displacements of the 
system Sq^ through the Jacobian of the transformation: 


V dqk 


(22.158) 


The virtual displacements arc instantaneous; therefore, there is no variation in time to take into account. 
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Af- = 0. The first term in (22.154) expressed in generalized coordinates is thus, 




(«• (?N) 

5 ('“Si 9 



(22.159a) 

(22.159b) 

(22.159c) 


implying that 

Y^F[ a) ■6r i = Y^Q j 5q j , (22.160) 

i 3 

where Q 3 is called the generalized force. The second term in (22.154) expressed in generalized coordinates 
is 


'^2p i -Sr i = Y^ mfi ■ Sri 


dr, 




E 


ho 


dqj 
3 J 

.. dri 

rriiTi • ——oqj 
dqj 


\ ( \ ^ .. dr * 


5q.j. 


(22.161a) 

(22.161b) 

(22.161c) 

(22.16 Id) 


The summation in the inner parentheses can be rewritten by using the product rule of differentiation 


E rriifi 
i 


dr.j 

dqj 



dri \ ■ d (dri\\ 

dqj) mri ' dt UJJ 


E 



(rmri 


dr A 

dqj) 


- rriiri ■ 


drA 
dq 3 ) ’ 


dv.i dri 


(22.162a) 

(22.162b) 


Differentiating (22.157) by q 3 gives 
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and inserting into (22.162b) results in 

E .. dri (d ( dvA dv.i 

ReaiTanging terms and substituting back into (22.16Id), it is found that 


i j \ 


d_ d_ 

dt dqj 


X | <% 


We can now substitute (22.160) and (22.165) into (22.154) 




(22.164) 


(22.165) 


(22.166) 


where 




(22.167) 


is the total kinetic energy of the system. Since all 5qj arc independent, the equation can only hold if the 
coefficients arc zero: 


d fdT\ 8T 


dt \dqj J 8q 


-Qj = 0 


(22.168a) 


d fdT\ dT 

dt \ dqj J dqj J ’ 


(22.168b) 


It will be assumed that the forces constitute a gradient field (Definition 18.38), that is, that they may be 
found as a gradient operator on a scalar field, V, as 


F S a) = -ViV, 


(22.169) 


where V (r i, r- 2 ,..., rjy, t) usually denotes the potential energy, and V t is the gradient of V with respect 
to its coordinates corresponding to r l . The generalized forces then become, 


O - V F {a) • — 

i J 

i 

dV dri 

" dri dqj 


(22.170a) 


(22.170b) 


(22.170c) 


(22.170d) 
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Inserting this into (22.168b), it is found that 

d (dT\ d(T-V) 

dt \ dqj ) dqj 

Assuming that V is independent of dqj, which is regularly the case, this is reduced to 

d f d(T-V) \ d{T-V) 

dt \ dqj ) dqj 

The Lagrange’s equation is found by defining the Lagrangian L as 

L = T-V ; 


(22.171) 


(22.172) 


(22.173) 


which results in 


d ( dL\ dL 
dt \ dqj ) dqj 


(22.174) 


for all j. Observe that if some of the applied forces, f,, could not be derived from a scalar potential, then 
Lagrange’s equation becomes 


d { dL\ dL 
dt \dqj ) dqj 


(22.175) 


for all j, where 


Qi = T,f> 


dr t 

dqj 


(22.176) 


22.5.2 Lagrange’s Equations from Hamilton’s Principle 

An alternative derivation of Lagrange’s Equations uses Hamilton’s principle. Hamilton’s principle states 
that the motion of the system from time t\ to time t? is such that the line integral of the Lagrangian, 

rti 

1= L dt, (22.177) 

Jt\ 

has a stationary value for the actual path of the motion. In other words, Lagrange’s equation can be derived 
by setting the variation of the integral, 

81 = 8 f L(q 1 ,...,q n ,q 1 ,...,q n ,t) dt, (22.178) 

Jt\ 

equal to zero, see Chapter 21 for an introduction to calculus of variation. Forces that cannot be written in 
terms of potentials arc ignored in C similarly to (22.175). 

Calculus of variation assumes a continuous system, however, until now the system has been described 
by a discrete set of particles given in generalized coordinates, qi s. Fortunately, an object may be considered 
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as the continuous limit of infinitely many small particles. This is similar to thinking of a real-world object 
as a continuous entity consisting of many, but a finite number of atoms. 

In the continuous limit, the index i should be taken to be a continuous index, a, of the generalized 
coordinates, 

j~a6Sl, (22.179) 

where 17 is the domain (that is, the volume of the object). For simplicity, a single continuous index will 
be used; nevertheless, the mathematical treatment in the following may easily be generalized to three or 
more indices. 

* - {ai,a 2 ,a 3 } . (22.180) 

One way to think of these continuous indices is as a parameterization of the volume of a deformable 
object. 

Example 22.2 

For instance, imagine that the volume is given by the B-spline volume, B(u, v, w), defined as 


B(u,v,w) = YT.Y, N i(u)Nj(v)N k (w)P iikr (22.181) 

i j k 


where N is the B-spline basis function and Pijk tire the control points. Here, u, v, and tu are the B-spline 
volume parameters, and they could be used as continuous indices. Given specific values of u, v, and w 
would uniquely pinpoint a single particle in the material. 


Given continuous indices, the Lagrangian is written as an integral. 


L = 



(22.182) 


where £ is known as the Lagrangian density. The Lagrangian density is a function of time, indices, 
displacements, and derivatives of displacements, 


£ 


£ a,t, r(a, t), 


dr(a,t ) dr(a,t) 


da 


dt 


(22.183) 


where r(a, t ) is the displacement function, also sometimes called the field quantity. For a deformable 
object, the displacement r(a,t ) would be the displacement of the point, a, from its original placement 
(that is, r(a, 0) = 0) in the deformable object. 

Hamilton’s Principle can be stated as 

81 = 6 [ 2 [ C da dt = 0. (22.184) 

Jti J 

Using calculus of variation, the integral equation is converted to the differential equation. 


d ( dC \ d ( dc\ dC 


(22.185) 
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In general, the displacement vector r(o, t) for a system of independent variables a = { a ,} is written in 
condensed form as 


d dC 


dt \d% 


5_C 

dr 


= Q, 


(22.186) 


where 4- is called the variational derivative for a system of independent variables a*, and it is given as 

d d 


5_ _ d_ 
5r dr 


E 


*>< efe' 


(22.187) 


For integral equations involving second-order derivatives, the variational derivative is given as 

d 2 d 


5_ _ d_ 
5r dr 


E d d ^—\ 

daidW + ^ 


ddi 




da.jdcii d^4- 


(22.188) 


22.6 Principle of Virtual Work 

Virtual work is a method of introducing Active constraints on an under-constrained system without adding 
or subtracting energy. It will be shown in this section that the constraint forces can be written as 

f = J T A, (22.189) 

where / is the constraint force, J is the Jacobian matrix, and A is a vector of Lagrange multipliers. From 
the Newton-Euler equations (see Section 7.1 for details) the generalized acceleration vector can be derived 
as 

s = M~ l (/ + /)■ (22.190) 

Here, M is the generalized mass matrix, s is the generalized position vector, and f is the generalized 
applied force vector. 

From a time-independent holonomic constraint, <F(s) = 0, the following kinematic constraint can be 
derived: 


(22.191a) 

(22.191b) 
(22.191c) 

Differentiation w.r.t. time leads to the acceleration constraint 

<E> = js + Js = 0. (22.192) 


d$ 

& = -7T- 8 

ds 


= Js = 0 


© 


© 


© 


© 



“book” — 2005/9/30 — 15:44 — page 689 — #701 


22.1 PHYSICS-BASED ANIMATION 


689 


Here, there arc some subtleties concerning the term j: from the chain rule it can be seen that it involves 
taking a matrix differentiated w.r.t. a vector 


7 _ 5J. 

J - o 

OS 


which would imply a 3D tensor. In practice, this can be avoided by using 


■ 3 $ 

ds 

Insertion of s in the acceleration constraint yields 

* = Ja + JM~ l (/ + /) = 0 , 

and rearranging gives 

JM~ l f = —JM~ 1 f - js. 

The principle of virtual work states that constraint forces do no work, that is, 

f -8 = 0. 


(22.193) 

(22.194) 

(22.195) 

(22.196) 

(22.197) 


However, from the constraint problem it is known that Js = 0, and in combination with the principle of 
virtual work this leads to 

f = J T A. (22.198) 

To get an intuitive grasp of this equation, it may help to think in terms of simple geometry. The rows of 
the Jacobian J arc a collection of vectors, each of which is the gradient of a scalar constraint function. In 
particular, the f th row is the gradient of the Tth entry in <J>. In fact, the gradients arc the normals to the 
isosurfaces given by $ = 0; their directions represent the domains where the solution is not permitted to 
move into. To ensure this, it is required that the change of state s is orthogonal to all those normals. In 
other words, s is only allowed to change in the tangential direction, and this is exactly what the condition 
J.s = 0 means. The set of vectors, /, spanned by J 1 A arc therefore all orthogonal to the state of change 
vector s = 0, and it will therefore always be the case that / • s = 0, which is exactly what the principle of 
virtual work states. 

The Lagrange multipliers may be solved by using / = J T A, implying that 

JM" 1 / = —JM l f - Js , (22.199a) 

JM~ 1 J T \ = -JM~ 1 f-js , (22.199b) 


from which the vector of Lagrange multipliers, and in turn the constraint forces, may be computed. 
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Figure 22.17: A graphical representation of the deformation of an object. 

22.7 Stress and Strain 

In order to model deformable objects, a way to measure deformation is needed. In this section, we will 
define measures for deformation and relate them to stress in a deformable object. The field of continuum 
mechanics has derived a general theory in this respect, which will be described in a condensed form. 

Consider an object in an undeformed state, as seen on the left in Figure 22.17. A point within the 
undeformed object is denoted by the vector a, given in material coordinates. When the object is translated, 
rotated, or deformed, the material at point a changes location. This new location is described by the vector 
x, given in world coordinates. This is illustrated on the right in Figure 22.17. The displacement of the 
material is given as 

u = x — a. (22.200) 

For any point a in the material, there exists a mapping toward its actual position in the deformed 
object, given by x(a). This function is undefined if the vector a is outside the material. Knowing the 
function, one can calculate the change of x with respect to change of a by 

dx = Fda, (22.201) 

where Fij is the deformation gradient (also known as the Jacobian), given by 

Fij = ( 22 . 202 ) 

daj 

which describes the change in world coordinates with respect to change in material coordinates. The 
indexes i and j define a particular value of the vectors x and a, respectively. 

A displacement does not necessarily mean that the object deforms. Rotating and translating the object 
gives rise to a displacement, but not a deformation. The deformation gradient in (22.202) is invariant 
to translation, but not to rotation, since the derivative of vector x changes with respect to a, as seen in 
Figure 22.18; but the magnitude does not change. Therefore, it is possible to describe deformation as the 
change in squared length of dx and da: 

dx 2 — da 2 = dx T dx — da T da. (22.203) 
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Figure 22.18: A rotation changes the derivative of vector x with respect to a; but the magnitude does not 
change. 


Using (22.201), this can be rewritten in terms of material coordinates only, to obtain 

da T F T Fda — da T da = da T (F T F — /) da = da T 2£da. (22.204) 

The symbol £ is Green’s strain tensor also known as the Lagrangian strain tensor , defined as 


S=\{F T F-l). 


(22.205) 


The Green strain tensor is invariant to rigid body motion. Rewriting it in terms of partials, one obtains 

1 / dx dx 


^ 2 \ dat daj ^ lJ ^ ’ 


(22.206) 


where d }J is the delta function. Using (22.200), this can further be rewritten in terms of the displacements, 
by 


£ = 


1 ( 8 (u + a) d(u + a) 


daj 


d aj 


- 6 , 


v 


1 (du du du da da du da da 

+ + + — — S t 


2 \daj daj daj daj daj daj daj daj 
_ 1 f du du duj diij 

2 yda* daj daj daj J 




(22.207a) 

(22.207b) 

(22.207c) 


The Green strain tensor contains a quadratic term, which means that analysis of large deformations be¬ 
comes nonlinear. However, if the displacements arc very small, the quadratic term becomes negligible. 
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This is the assumption for the Cauchy strain tensor, which is a first-order approximation to the Green 
strain tensor, given by 


1 / dui duj 

2 \daj den 


(22.208) 


In addition to the strain at a given point, one can calculate the rate of change by taking the derivative 
with respect to time. For the Green strain tensor, given in (22.206), this becomes 


d ^ d 1 / dx dx \ 1 / dx dx dx dx 

dt y dt 2 \ da n daj > J ) 2 daj da* daj 


From the above, the strain rate V is defined as 


dx dx dx dx 

y da t daj <9a* daj 


(22.209) 


( 22 . 210 ) 


The stress in a deformable body is related to the strain and the strain-rate. This relationship is deter¬ 
mined by the elastic and viscous properties of the material. A way to express this relationship is by the 
2nd Piola-Kirchoff stress tensor S, given by 


S = S (£) +S (V) . 


( 22 . 211 ) 


The elastic stress, S^ £ \ depends on the Green strain tensor, defined in (22.206), and the viscous stress, 
S^' \ depends on the strain-rate, defined in (22.210). These arc given by 




k =1 1=1 


<’ = EE Dij kl Vkl j 


(22.212a) 


(22.212b) 


where the relation between the elastic stress and the strain is given by the stiffness matrix C, and the 
relation between the viscous stress and the strain-rate is given by the damping matrix D. In three di¬ 
mensions, the strain and stress tensors consist of nine values. To fully describe how each of the strain 
values affects the stress, it requires 9x9 = 81 coefficients for both the stiffness matrix and the damping 
matrix. But since £ is symmetric, most of the coefficients of C arc redundant, and can be reduced to 6 x 6 
different values. If the material being considered is isotropic, the material properties arc independent of 
direction. Such materials have only two independent variables (that is, elastic constants) in their stiffness 
and damping matrices, as opposed to the 36 elastic constants in the general anisotropic case. 

For isotropic materials, the elastic stress tensor is given by 


Sly 1 = ]T A SkkSij + 2 pSij, 


(22.213) 
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where // determines the rigidity of the object, and A determines the resistance to a change in volume. A 
similar argument can be done for the viscous stress. Since the strain tensor is symmetric, the strain-rate 
tensor will also be symmetric. The damping matrix D is therefore reduced to the two values (j) and u>, and 
the viscous stress for isotropic materials is calculated by 

3 

jGO _ 


S-j = 4>VkkSij + 2 ipVij. 


(22.214) 


fc=t 


The isotropic elastic stress tensor from (22.213) can be conveniently written on matrix-vector form: 


(22.215) 


using the fact that £ is symmetric. 

The isotropic elastic stress tensor given above uses the Lame coefficients p and A. Often materials are 
specified using Young’s modulus E and Poisson’s ratio u. To convert from one to the other, one can use 
the formulas: 

vE 


Sn 


A + 2p 

A 

A 

0 

0 

o' 


£11 

S-22 


A 

A + 2p 

A 

0 

0 

0 


£22 

5 33 


A 

A 

\ + 2p 

0 

0 

0 


£33 

5*23 


0 

0 

0 

2p 

0 

0 


£23 

5.31 


0 

0 

0 

0 

2p 

0 


£31 

5,2 


0 

0 

0 

0 

0 

2/i 


£12 


A = 


ri = 


(1 + i/)(1-2i/) 
E 


2 (l + i/)' 

Inserting these conversions in (22.215), gives the commonly seen stress-strain relationship: 


(22.216a) 


(22.216b) 


"5n" 


'l - V 

V 

V 

0 

0 

0 


£11 

522 


V 

1 - V 

V 

0 

0 

0 


£22 

5 33 

E 

V 

V 

1 - V 

0 

0 

0 


£33 

5 23 

~~ (1 + v)(l — 2u) 

0 

0 

0 

1 - 2v 

0 

0 


£23 

5 3 i 


0 

0 

0 

0 

1 - 2v 

0 


£31 

5l2 


0 

0 

0 

0 

0 

1 - 2u 


£ 12 . 


(22.217) 


Comparing (22.217) with for example, the one given in [Zienkiewicz et al., 2000, Chapter 6 ], there is a 
factor of \ in the difference on the last three diagonal elements. To explain this, write out the entry €23 in 
the Cauchy strain tensor from (22.208): 


^23 — 


1 (du 2 du 3 


2 V das da 2 


(22.218) 

This is the average of the two shear strains. The total of the two shear strains is sometimes used instead: 

(22.219) 

which gives rise to the difference. This is sometimes called engineering shear strain. 


723 = 


du -2 dus 
das 9a 2 
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23 

Differential Equations and Numerical Inte¬ 
gration 

Differential equations of interest in animation arc often so complicated that it is impossible to find an¬ 
alytical solutions. Therefore, this chapter concentrates on numerical solutions using finite differences 
for explicit and implicit solution methods. Euler [Euler, 1768] was probably the first to use finite dif¬ 
ferences for solving differential equations, while many consider Courant, Friedrichs, and Levy’s paper 
[Courant et al., 1928] as the stalling point of the modern numerical methods for solving partial differen¬ 
tial equations. In this chapter, we will review [Press et al., 1999a, Chapter 16] on ordinary differential 
equations, [Ames, 1969, Collatz, 1986, Morton et al., 1994, Butcher, 2003] on partial differential equa¬ 
tions, [Baraff, 1997a, Baraff, 1997b] on explicit and implicit integration of differential equations, and 
[Sethian, 1999, Osher et al., 2003] on level-set methods. 

A differential equation is an equation where the differential of one or more functions appeal - . They 
are typically classified as either being ordinary or partial. An ordinary differential equation {ODE) is an 
equation of one independent variable x and one dependent variable y(x) and its derivatives. For example, 
F[x, y(x), y'(x), y"(x),... ] = 0, where y^ n \x) = A partial differential equation ( PDE ) is an 

equation where partial derivatives occur, such as F[x, y , z(x, y),z x (x, y),z y (x, y),z xx (x, y),... ] = 0, 
where z(x,y) is the variable depending on the independent variables x and y, and z x (x,y ) = 
etc. The highest order of the derivative occurring in a differential equation is the order of the differential 
equation, and if the differential equation can be written as a polynomial in the dependent variables and 
their derivatives, then the degree of the equation is also called the degree of the differential equation. A 
differential equation of degree one is said to be linear. 

23.1 Ordinary Differential Equations 

Ordinary differential equations may always be reduced to a problem involving only coupled differential 
equations of the form 

dxfft) 

df =X! = Jl(t, Xl, ...X N ), 

dx 2 (t ) . 

—7— = x 2 = h(t,x 1 , ...x N ), 


TT = X N = f N (t,Xi,...X N ), (23.1c) 

dt 


(23.1a) 

(23.1b) 
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Figure 23.1: A function (a) and its corresponding gradient field (b). 


where the expressions /', arc given, together with some boundary conditions on x % . As an example, 
consider a 2D function / : M 2 —> K, and its gradient V/ : R 2 -> l 2 , as shown in Figure 23.1. The 
gradient of / is given by 


V/ 


r _2 li 

dx\ 

df ’ 


(23.2) 


Ldx2 J 


and solutions to x(t) = <f>(x,t) = —V/will be curves [x\{t), X 2 (t)] T , which have —Vf(x,y) as tangent. 
It may be helpful to think of these curves as the paths a ball will follow when it rolls down the hill of /. 

For some ordinary differential equations, the analytical solution is simple to find. As an example, 
consider the ordinary differential equation 


x(t) = <f>(x,t) = —kx(t ). 


By writing the differentiation explicitly. 


x{t) = 


dx 

dt 


—kx, 


and by rearranging the dx and dt, we find 


—dx = —k dt. 
x 


(23.3) 


(23.4) 


(23.5) 


Each side of the equation now denotes a small infinitesimal element in x and t respectively, thus by 
integrating both sides 

J — dx = — J k dt, (23.6) 
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we may find a solution as 


In x = —kt. + c (23.7) 

with some unknown integration constant c. Isolating x, we conclude that the particular - differential equa¬ 
tion has the analytical solution. 


x = e ~ kt+c = e c e~ kt 


= Ce 


—kt 


(23.8) 


and the unknown constant C may be determined if boundary conditions are specified. In spite of the 
apparent magical rearrangement of differential terms and integration w.r.t. different variables on each side 
of the equation, it can readily be demonstrated that the above equation is a solution to (23.3) 


dx dCe kt 
dt dt 


—kCe kt = —kx. 


(23.9) 


Unfortunately, most differential equations do not have analytical solutions. Instead, their solutions must 
be approximated by finite differencing schemes, for example. 


23.1.1 Explicit Euler Integration 

Assume we are given a single first-order ordinary differential equation, r(t) = <p(t,r). We think of 
r = [ay, X‘i ..... .x,y] 7 as a particle moving on a height field, as shown in Figure 23.1(a), or more precisely, 

as traveling in a corresponding gradient field <j), as shown in Figure 23.1(b). We thus wish to find the 

trajectory, r, of the particle under initial conditions, r(0) = r o; r(0) = f*o. 

Examining the Taylor expansion (20.7) for r around f to first order: 

r(t + h) = r(t) + hr(t ) + 0(h 2 ) (23.10a) 

= r(t) + hcj){t) + 0(h 2 ). (23.10b) 

We realize that the function r may be approximated from <f> by iteratively calculating r(t + ih) for i = 
1... n up to the order of precision of h 2 in each step. This procedure is an approximation of 

n ,,T 

r(hT ) = r(0) + h(f>(ih) ~ r(0) + / <f>(t) dt, (23.11) 

i=o Jo 

where T = nh. This is illustrated in Figure 23.2. This is known as explicit Euler integration. Unfor¬ 
tunately, the iterative nature of the approximation implies that the error term accumulates. The solution 
is simple to implement and fast to compute, but neither stable nor very accuratel This is demonstrated 
in Figure 23.3. A vector field is shown in Figure 23.3(a), whose continuous solutions are circles, some 
of which are indicated. Any explicit solution will give spirals, since any finite step will follow a locally 
lineal - approximation of the circular arc, thus leaving the correct solution. This is most prominent near 
the center, where the curvature of the correct solution is highest. In 23.3(b) we show a vector field whose 
continuous solutions are translated exponentials. Taking steps that are too large will make the solution 
diverge, since the locally linear approximation brings the numerical approximation to vectors of gradually 
increasing size, thus making the numerical approximation explode. 
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Figure 23.3: Explicit Euler is inaccurate and can be unstable. Arrows denote the velocity field, thin lines 
arc the continuous solutions, and thick lines arc a numerical approximation. 
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23.1.2 Improving Accuracy with the Runge-Kutta Method 

The accuracy of Euler’s method may be improved by taking into account higher-order terms in the Taylor 
series. The midpoint method (second-order Runge-Kutta method) uses the Taylor-series to second order. 


r(t i+ 1 ) = r(ti) + hr(ti ) + — r(U) + 0(h 3 ). 


(23.12) 


Although we are not given r, we may find a clever approximation as follows: recall that r = 4>{r), and 
therefore 

r = = V0(r(t)) r = V0(r(i)) 4>(r(t)). (23.13) 


dt dr dt kjj "tk k 

Further approximating (f>{r{t)) by first-order Taylor series, 

4>{ri + A r) = 4>{r.i) + V0(rj) A r + 0(Ar 2 ), 
and cleverly choosing A r = |</>(rj), we may write 

<t>{ r i + = 0(n) + V0(ri) ^0(r<) + C>(/i 2 ) 


= ^ + C ) (^ 2 )- 


Thus, by isolating 4^ r we find 


= h<j)(ri + ^<f>(ri)) - h<j>(ri) + 0(h 3 ), 
which may substituted directly into (23.12) as follows: 

h? o 

r i+ i = ri + hr.i + — Ti + e>(/n 

= Ti + h<f>{ri + |^(r**)) + C>(h 3 ). 


(23.14) 


(23.15a) 


(23.15b) 


(23.16) 


(23.17a) 


(23.17b) 


Thus, we conclude that by probing 0 in r* + we have made a more accurate numerical approxi¬ 

mation. 

Further improvement in accuracy along the approach of the midpoint method is very tedious. However, 
a general result exists [Butcher, 1963, Hairer et al., 2002, Butcher, 2003]. 

Definition 23.1 (s-stage Runge-Kutta method) 

Given coefficients b t , aij , E M, i,j = 1... s, and letting Ci = Ylj=i a ij> ^ len an s ~ sta 8 e Runge-Kutta 
method is given as 


(23.18a) 


ki = cf> ^t 0 + ah, r 0 + h CLijkj j , 

S 

ri = r 0 + h^biki. 


(23.18b) 
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The method is of 


order 1 if 


order 2 if the above and 


order 3 if the above and 


I> = i 

i =1 


1 

E 6iC *= 2 

i= 1 



s s 

and EE bjCL'ljCj 

j =1 i =1 


1 

6 ' 


(23.19a) 

(23.19b) 

(23.19c) 


For conditions on higher order are more complicated (see [Hairer et al., 2002, Section III.1] for a deriva¬ 
tion). 


The coefficients are often depicted as 


Cl 

an . 

• • Oj\ s 

C s 

a s i 

.. a ss 


bi ■ 

• • b s 


in which case we have 


explicit Euler : 


implicit Euler : 


trapezoidal integration : 


explicit midpoint : 


4th-order Runge-Kutta integration : 



0 

0 



1 


1 

1 



1 

0 

0 

0 

1 

1 

0 


1 

1 


2 

2 

0 

0 

0 

l 

1 

?, 

0 


0 

1 


0 

0 

0 

0 

0 

1 

2 

1 

2 

0 

0 

0 

1 

2 

0 

1 

2 

0 

0 

1 

0 

0 

1 

0 


1 

I 

1 

1 


6 

3 

3 

6’ 


(23.20) 

(23.21a) 

(23.21b) 

(23.21c) 

(23.21d) 

(23.21e) 


23.1.3 Adaptive Step Size Control 

For many systems, a major reduction in computation time can be obtained by allowing for adaptive step 
size. A simple method for adaptive control of accuracy is step doubling (and the analogous step halving), 
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© 
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where a big step is compared with the equivalent using two smaller steps. For an explicit Euler, two small 
steps of size h arc found to be 


r(t + h) = r(t) + hr(t) + 0(h 2 ), 

(23.22a) 

r(t + 2 h) = r(t + h) + hr(t + h) + 0(h 2 ) 

(23.22b) 

= r(t ) + hr(t) + hr(t + h) + 0(h 2 ). 

(23.22c) 

Conversely, a single step of size 2 h is found to be 

r 2 (t + 2h) = r(t) + 2 hr(t) + 0(h 2 ) } 

(23.23) 


and it is seen that the difference between r(t + 2 h) and r 2 (t + 2 h) depends only on h, and the difference 
between r{t) and hr(t + h), that is, 

Ar = r(t + 2h) — r 2 (t + 2h) (23.24a) 

= h ( r(t + h) — r(t)) + 0(h 2 ). (23.24b) 

Thus, when Ar is large, doubling will imply a large error of approximation, while h may be safely doubled 
for small Ar. 

To conclude, step doubling requires three steps, two of size h and one of size 2 h. The extra computa¬ 
tion time is often outweighed by the possible doubling of h. Step halving is analogous to the step doubling 
discussed above. 


23.1.4 Implicit Integration 


Explicit integration is accurate for small step sizes, h. By step doubling the step size may be adapted to 
reduce the number of steps dynamically. Nevertheless, there exists a class of differential equations, which 
requires extremely small steps to converge, and these arc called stiff ordinary differential equations. As 
an example, consider a particle unsymmetrically attracted toward (0,0). 


f(f) 


d 

x{t) 


7T 

V 

1 

dt 

y(t) 




y{t) / 0. 


(23.25) 


Large c results in unstable or stiff solutions by Euler’s method, which is seen by the first step. 


r(t + h) = r(t) + hr(t) 


'x(t) 

+ h 

' -x{t)~ 


(1 — h)x(t) 

yit). 

-«/(/)_ 


(1 - hc)y(t )_ 


In the limit for large h this solution does not converge. 


lim 

h —>oo 


(1 — h)x(t) 


—oo 

(1 - hc)y(t)_ 


—oo 


(23.26) 


(23.27) 


That is, when |1 — hc\ > 1, then the solution will diverge and thus become unstable. Conversely, the 
solution will converge to 0 when |1 — hc\ <1, which implies that 0 < he < 2 or equivalently h < 2/c, 
as h, c > 0. 
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Implicit integration stabilizes Euler’s method by solving the equation backward, that is, by solving 

r(t + h) = r(t) + hr{t + h). (23.28) 

At first glance, this may appeal - impossible, since it would require a solution to exist in order for us to be 
able to evaluate r{t + h). However, assuming that we are given a vector field 0(r (/)) such that 

r{t + h) = + h)), (23.29) 

then it is possible to solve (23.28) when 0 is linear in r, that is, when 0 may be written as 

0(r + Sr) = 0(r) + V0 Sr, (23.30) 


for some fixed r and variable Sr, and where V0 is the Jacobian of 0 taken in r. Cleverly choosing 
Sr = r(t + h) — r(t ) such that 0(r(f + h )) = 0(r(t) + Sr) = 0(r + Sr) and inserting (23.30) into 
(23.28) we find 

r(t + h) = r(t) + h (0(r) + V0 Sr) , (23.31a) 

r(t + h) — r(t) = h (0(r) + V0 Sr) , (23.31b) 

Sr = h (0(r) + V (j> Sr) . (23.31c) 

We can now isolate Sr as 

Sr — KV(j) Sr = /i0(r), (23.32a) 

(1 - W0) Sr = /i0(r), (23.32b) 

Ql - V0^ Sr = (23.32c) 


where 1 is the identity matrix. The matrix ^ 1 — V0 is often invertible, since its structure often will be 
dominated by 1, in which case the system may be solved for Sr as 


Sr 


-l 


1 — V0 0(r 


(23.33) 


and in which case 

r(t + h) = r(t) + Sr. (23.34) 

In the example of our particle attracted to [0,0] T , we have 0(r) = r = [— x, —cy] T , hence. 


V0 = 


-1 

0 



(23.35) 
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and 


whose inverse may be seen to be 


—1 - - 


1 + 1 


\ + c 


- 


-1 


h 

1—// 
0 


0 

h 


1 -\-hc. 


Thus, the solution for dr is therefore 


Sr 


' ~TTh x ^ ' 

-rfes c 2/(0. ' 


H + 1 0 

-1 

' -x{ty 

[ 0 Ti+4 




(23.36) 


(23.37) 


(23.38a) 

(23.38b) 


In contrast to (23.27), the implicit solution is unconditionally stable and will converge regardless of the 
size of h, since 


lim Sr 

h —>oo 


i 

-+o 

1 

1_ 


7T 

V 

1 

L—cj/W. 




(23.39) 


and thus 


r(t + h) = r(t) + Sr = 0. 


(23.40) 


We conclude that the implicit solution will converge for arbitrarily large h for this example, in contrast to 
the explicit solution, which requires h < 2/c for convergence. 


23.1.5 Semi-Implicit Integration 

If 0 is nonlinear, then a closed-form solution cannot be found. Nevertheless, a linear approximation may 
be used 

<p(r(t + h)) = <j>(r(t)) + V0 Sr + C’dl^r*||~ 0(r(t)) + V0 Sr, (23.41) 

ignoring the higher-order terms. The derivations are identical to those presented in the previous section, 
except they are now approximations. This is known as semi-implicit integration. Hence, it is not guaran¬ 
teed that the solution is stable, but it usually is because the approximation is locally precise. 


23.2 Partial Differential Equations 

Many differential equations in animation contain partial derivatives and are therefore partial differential 
equations (PDE s). We will only treat linear partial differential equations; for nonlinear partial differential 
equations, see [Ames, 1969, Morton et al., 1994], 
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23.2.1 Linear First-Order Partial Differential Equation 

An example of a linear first-order partial differential equation of a function u(x, y) is 


Au x + Bu y = 0, 


(23.42) 


where u x = <>u j^ y ' > and u y = 9u ^ y ^ ■ This equation can be simplified into a ID problem by a change of 
base. Consider the base given by some arbitrary parameters a, b, c, and d: 


a = ax + by and /3 = cx + dy. 


The terms in (23.42) may be rewritten in the new base as 


u x = 

u y = 


du 

dx 

du 

dy 


du da du d/3 

da dx d/3 dx 

du da du d/3 

da dy d/3 dy 


u a a + upc 


u a b + upd. 


(23.43) 


(23.44a) 

(23.44b) 


Inserting (23.44) into (23.42) we find 


0 = A(u a a + upc) + B{u a b + upd ) = u a (Aa + Bb ) + up(Ac + Bd). (23.45) 

Remember that the coordinate shift was arbitrary and does not influence the solution. Hence, we may 
choose the coordinate system, for which the solution is most easy to find. Consider a system where 
Ac + Bd = 0, for example c = B and d = —A. Assuming that .4« + Bb / 0, we may divide both sides 
of the equation by Aa + Bb resulting in 

u a = 0. (23.46) 

Hence, the solution does not change in the a direction and is therefore independent on a. On the other 
hand, there are no bindings in the equation on the solution in the f3 direction; it can therefore be any 
differential function in (3. Hence, the general solution is 


u(x, y) = f((3) = f{cx + dy) = f(Bx - Ay) (23.47) 


where / is an arbitrary differentiable function. A single first-order partial differential equation behaves 
similarly to a hyperbolic second-order differential equation. Numerical solutions will be discussed in 
Section 23.2.5. 

Systems of two first-order partial differential equations may be classified according to the flow of 
information. Consider two 2D functions u(x, y) and v(x. y) and the following system of equations: 


a\u x + b\u y + c\v x + d\v y = f\ (23.48a) 

a 2 u x + b 2 u y + c 2 v x + d 2 v y = f 2 . (23.48b) 

We will now investigate conditions for unique solutions to the partial derivatives. Assume that the func¬ 
tions u and v have been solved for up to some curve T, as illustrated in Figure 23.4. We now wish to 
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Figure 23.4: An assumed state in the process of solving a differential equation. The solution is assumed 
found up to the curve F. 

investigate under what conditions the partial derivatives arc uniquely determined at point P on F using 
values of u and v on T. 

The tangent vector to T at P defines a local linear fit to the curve, and assuming that it is nonvertical 
we may write it as 

n(x) = ax + f3, (23.49) 

and on the curve we may write our 2D functions as u(x, //()) and v(x. //(.i;)). Hence, the partial deriva¬ 
tives of u and v along the curve arc 


du 

dx 

dv 

dx 


— tt x T Hy , i 


= V X +Vy 


dji 

dx' 

dfi 

dx 


(23.50a) 

(23.50b) 


Since u and v arc known on the curve, their derivatives in the tangent direction arc also known. Combining 
(23.48) and (23.50a) results in a system of equations: 


at 

<22 

1 

0 


bi 

Cl 

di 


Ux 


7f 

b 2 

c 2 

d 2 


ILy 


h 

d[A 

dx 

0 

0 


v x 


du 

dx 

0 

1 

dfi 

dx- 


_Vy_ 


dv 

-dx- 


(23.51) 


where all coefficients arc known except the partial derivatives of u and v. The system has a unique solution 
to u x , u y , v x , and v y , when the determinant of the system matrix is nonzero. When the determinant is 
zero, that is. 


(b 2 d\ - b\d 2 ) + (—& 2 ct + bic 2 - a 2 d\ + a\d 2 ) ^ + (a 2 ci - aic 2 ) 



(23.52) 
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the system does not have a unique solution. The above equation is called the characteristic equation. 

Slopes © of the curve T at P, which are solutions to the characteristic equation, arc called the char¬ 
acteristic directions. The differential equation may be classified according to the discriminant of the 
characteristic equation. 


{—b 2 c\ + b\c 2 - a 2 d\ + a\d 2 ) 2 - 4 (a 2 ci - aic 2 ) (b 2 di - b\d 2 ) , (23.53) 

and the characteristic directions may be real and distinct, real and identical, or imaginary, when the dis¬ 
criminant is positive, zero, or negative, respectively. This induces a classification of the system of linear 
first-order differential equations (23.48), which arc called parabolic for zero discriminant, elliptic for 
negative discriminant, and hyperbolic for positive discriminant. These classes arc in a physical setting 
typically characterized as 

Parabolic 

A time-dependent, dissipative process that evolves toward a steady state. 

Elliptic 

A time-independent process, and thus is in its steady state. 

Hyperbolic 

A time-dependent process that does not evolve toward a steady state. 

The characteristic directions arc important for the numerical schemes used to produce stable implementa¬ 
tions of the partial differential equations to be discussed next. 

23.2.2 Linear Second-Order Differential Equations 

For a 2D function u : M 2 —r M, the simplest second-order differential equations arc 

u xy = 0, (23.54) 

since it is readily verified that 

u(x, y) = f(x) + g(y), (23.55) 

is a solution whenever / and g are some arbitrary ID differential functions. The functions u, which 
fulfill (23.54), are conceptually produced by translation. Consider a fixed function g(y) = ?/((), y). The 
remainder of u is produced by translating g along x using f(x) as its offset. 

The wave equation, 

C Uxx tlyy — 0 (23.56) 

for some constant k / 0, is similar to (23.54) in that the solution may be written as two independent 
differential functions along two directions. Consider the same change in base as (23.43): 

a = ax + by and (3 = cx + dy. (23.57) 
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Continuing from (23.44) it is found that 

du x du x da du x df3 

V'xx — q — — -Qql ~Qx ^ d/3 ~Qx — 4“ 4“ (tt a/. 3® 4“ UppCJC, 

du v du v da du v d(3 . , , 

= — = + = ( u aa b + rx a/3 d)& + (u aj3 b + uppd)d. 

Inserting the above equations into (23.56) and rearranging terms results in: 

0 = K 2 (u aQ a 2 + 2 u a gac + UggC 2 ) - ( u aa b 2 + 2 U Q gbd + Uggd 2 ), 
= u aa (n 2 a 2 — b 2 ) + 2 u a g{n 2 ac — bd) + ugg{n 2 c 2 — d 2 ). 

We now choose a coordinate systems such that 

K 2 a 2 — b 2 = 0, 
k 2 c 2 — d 2 = 0, 

K 2 ac — bd / 0. 

For example, b = na and d = —kc, which implies that 

u a g 0 , 


(23.58a) 

(23.58b) 


(23.59) 


(23.60a) 

(23.60b) 

(23.60c) 


(23.61) 


and that the solution is: 

u = /(ct) 4- g(/3) = f(ax + by) + g(cx + dy) = f(ax + nay) + g(cx — ncy). (23.62) 

The constants a and c may, without loss of generality, be chosen to be a = c = 1. The solution is the sum 
of two translated functions. The first term, f(x + ny), is produced as a linear translation with slope k of a 
mother function at y = 0. Likewise, g(x — ny) is a translation with slope — n of another mother function 
at y = 0. Hence, assuming that k is positive, then as y is increased, the / function is translated toward 
smaller x values and g is translated toward larger x values, both with velocity k. 

Any quasi-linear second-order partial differential equations may be written on the form 

-4 11j-j- T Bu X y + Cuyy T F[u x , u.)j , f ) — 0, (23.63) 

where F(u x , u y , b ) is any function of the first-order partial derivatives and a function / not depending on 
u. The equation is quasi-linear since F may be nonlinear. Analogous to the first-order system, we may 
classify (23.63) according to its characteristic equation. To derive the characteristic equation, consider an 
equation solved on half of its domain up to the curve T, as illustrated in Figure 23.4. The curve F may be 
approximated locally at P along the tangent as the line 

/i(x) = ax + (3. (23.64) 
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Classification 

Condition 

Characteristic Directions 

Parabolic 

B 2 - AAC = 0 

1 

Elliptic 

B 2 - 4 AC < 0 

0 

Hyperbolic 

B 2 -4AC> 0 

2 


Table 23.1: The classification of a general linear second-order partial differential equation (23.63) accord¬ 
ing to its characteristic equation. 


The derivatives u xx , u xy , and u yy arc known along T, and we wish to find slopes of T for which the 
derivatives cannot be determined across T. On T we can write u x {x, //(.x)) and ii y (x, / /(a;)), and since we 
know u on T, then we can calculate the derivatives as 


du x dfi 

dx dx 

du v d/x 

dx dx 

Writing the system of equations as 


'A 

B 

C' 


v*XX 


' 0 ' 

1 

d/i 

dx 

0 


'U'xy 

= 

du x 

dx 

_0 

l 

dfi 

dx- 


Uyy_ 


dlLy 

L dx -1 


(23.65a) 

(23.65b) 


(23.66) 


The derivatives u xx , u xy , and u yy are undetermined if the system matrix is singular, that is, 


A 



-B d A + C = 0 . 

dx 


(23.67) 


which is a second-order equation in '-jg. That is, the number of characteristic directions '-jg are determined 
by the discriminant 

B 2 - 4 AC, (23.68) 

and the second-order linear partial differential equations (23.63) arc classified according to Table 23.1. 

The second-order partial differential equation (23.63) may alternatively be transformed into a system 
of first-order linear equations, for example, by using v = u x and w = u y , resulting in the system of 
equations: 


0 = Av x + Bv y + 0w x + Cw y , 

0 = 0v x + 1 v y — 1 w x + 0 w y . (23.69) 

The two equations arc needed since we introduce an extra variable and hence need to fix the added degree 
of freedom. The first of the equations above can be seen to be (23.63) in disguise, while the second 
couples v and w by requiring that v y = w x <=> u xy = u yx . This system of equations can easily be shown 
to have the same characteristic equation, and this system of first-order equations will therefore have the 
same characteristic directions as the original second-order equation. 
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23.2.3 Parabolic Equations 

In the following, we will discuss convergence and stability for the parabolic equations. For a parabolic 
second-order differential equation. 


Au xx + Bu x y + CUyy — 0, (23.70) 

with B 2 — 4 AC = 0, there is one characteristic direction given by © — 0, which implies that // is 
constant. The prototypical example of a parabolic differential equation is the heat diffusion equation. 


du d 2 u 
dt dx 2 ’ 


(23.71) 


where t is time and x is the spatial dimension for a ID signal u(x, t). For the heat diffusion equation 
A = 1 and B = C = 0, and the discriminant (23.68) is therefore zero. 

Although several methods exist for solving the heat diffusion equation, we solve it with finite differ¬ 
ences on a grid u(iAx,jAt) — Cj.-j, where U is an approximation of u. We further impose boundary 
conditions U Xj o = u(x,0) = b\(x), ZTq,* = u(0,t) = 62 (f), and f/pt = u(l,t) = 63 (f). 


23.2.3.1 Stability and Convergence for Explicit Solutions. 

An explicit solution of the heat diffusion equation is found using a forward finite difference operator for 
the time derivative and a central difference operator for the spatial derivative: 


1 U hJ _ ('7-f -1 .j -f- Ui-ij 

At Ax 2 

Rewriting suggests the following update rule: 


(23.72) 


U i: j +1 = mU i+ ij + (1 - 2 m)Uij + mUi-ij, 


(23.73) 


where m = This is an explicit solution in j, since the solution at time j + 1 is explicitly given as a 
function of prior values of the solution at times j and earlier. The elements on the grid taking paid in the 
computation is often illustrated as a computational molecule, as shown in Figure 23.5. 

The truncation error, T, of the approximation is found by replacing the approximation U t j with the 
true solution, u(x,t) = u(iAx,jAt), that is, 


u(x, t + Af) — u(x, f) u(x + Ax, f) — 2 u(x, f) + u(x — Ax, f) 


Af 


Ax 2 


+ T(x, f) 


(23.74) 


and then inserting the Taylor series for u, 


E 

71—1 


(Af )"" 1 d n 


n\ 


dt' 


-u(x,t) = J2 


(Ax 


\n—2 Qn 


71—1 


n\ 


dx' 


-u\x 




(—Ax) 


71—2 Qn 


71—1 


n\ 


dx' 


-u(x,t) + T(x,f). (23.75) 
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Figure 23.5: The computational molecule for an explicit finite differencing scheme for the heat diffusion 
equation. 


Isolating T gives 


n*, o = t 0 -1 «>, 


whose leading terms are 

du(x,t) Atd 2 u(x,t) d 2 u(x,t) (Ax) 2 d A u(x,t) 2 a 4 a 

r <* *> = - V + “2 - -vr- -& 1 +° (At - + A * > 


A td 2 u(x,t) A x 2 d 4 u(x,t) 

2 dt 2 12 <9x 4 


+ 0(At 2 + Ax 4 ). 


(23.76) 

(23.77a) 

(23.77b) 


The simplification above was obtained by applying the heat diffusion equation, u / = u xx . From this we 
observe that the solution has first-order accuracy in At, but if Ax 2 = 6At, then the method has second- 
order accuracy in At. Furthermore, using the mean value theorem. Theorem 20.1, we realize that there 
are two constants xq and to in the open intervals xq € (x — Ax, x + Ax) and to € ft, t + At), where 


_Atd 2 u(x,to) Ax 2 d A u(xo,t) 

~ ~2 dt 2 12 dx 1 

At fd 2 u{x,to) 1 d A u(xo,t)\ 

2 \ dt 2 6 m dx A ) ’ 


(23.78a) 

(23.78b) 


where rn = ^4. Hence, if we assume that uu and u xxxx are bounded, then the truncation error is T —> 0 
as At, Ax —> 0, and we say that the method is unconditionally consistent. 

The convergence of a finite difference method is typically studied in two manners—either using Taylor 
series or Fourier series. These will be described below. For Taylor convergence, we write the error as 


Eij = Utj - u(iAx,jAj). 


(23.79) 
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The approximation U satisfies (23.73) exactly, while u differs by the truncation error, that is, 

Ui t j +1 — u(x,t + A j) = — u(x + Ax,/)) + (1 — 2 m)(Uij — u(x,t)) 

+ m(Ui -ij — u(x — Ax, /)) + A tT(x, /) (23.80a) 

Eij + 1 = mEi + ij + (1 — 2 m)Ei t j + + AtT(x, /). (23.80b) 

The coefficients in front of E arc positive as long as 

m > 0 and 1 — 2m. > 0, (23.81a) 

0 <m < (23.81b) 

and in which case 

|-E)j+i| < + (1 - 2m)\E i: j\ + m\Ei_ij\ + A/|T(x,/)|. (23.82) 

Since the sum of the coefficients on the right-hand side is always 1, That is, m + (1 — 2m) + m = 1, then 
the sum of the error terms on the right-hand side will never exceed the maximum value, that is, 

\Ei : j+ 1 | < max(\E i+ ij\, \E itj \, \E i _ 1 j\) + A/|T(x,/)|. (23.83) 

Finally, since the approximation agrees with the analytical solution on the boundary, (7,.o = u(iAx, 0), 
then the error must be zero on the boundary, 


E ifl — 0, 


(23.84) 


and by induction we find that 


Eij+i\ < jAt\T(x,t)\. 


(23.85) 


We conclude that the error E —► 0 as At —► 0, when 0 < m < and hence the numerical solution 
approximation converges to the analytical solution. 

In Fourier convergence, the error is expressed as a Fourier series and the stability is studied as limits 
on the Fourier coefficients. Let’s start by using the method of splitting, where it is assumed that u is 
separable, that is, 

u(x,t) = f{x)g{t). (23.86) 

The heat diffusion equation (23.71) then becomes 


f{x)g'{t ) 

a'(t ) 

9(t) 


f\x)g(t) 

/"(*) 
/(*) ' 


(23.87a) 

(23.87b) 
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Since both sides of the equation arc independent, they must equal a constant, 

g'{t) /"(*) 

git) fix) c ’ 

and we find that 

g(t) = exp(cf), 
fix) = sin(x\/—c), 


(23.88) 


(23.89a) 

(23.89b) 


where we further have to require that c < 0 for g to be bounded and for / to be real. Thus writing 
c = — k 2 , we find 

u(x,t ) = exp(— k 2 t) sin(kx), (23.90) 

and since the solution is independent on k, then any linear combination in k will also be a solution, that 

is, 

OO 

u(x,t)= a/- exp(— k 2 t) sin{kx). (23.91) 

k=— oo 

At the boundary, we have 

OO 

u (x, 0) = £ afcsin(fex), (23.92) 

k=— oo 

and we realize that this is a Fourier series with a/, as the Fourier coefficients. 

In the finite difference scheme (23.73) we’ll assume that the complex exponential is a solution, that is, 


Ui,j = exp(— kiAxy/— 1) (23.93a) 

= AC/ij-i, (23.93b) 

for some constant A, and where we write y/—l explicitly to avoid confusion with the index i, and where 
A exp(0 \f—l) is the complex exponential, 

A exp(0\/—1) = Acos(6l) + A\/—1 sin(0). (23.94) 


Using the recursive form of our guess (23.93) in (23.73) and dividing with a factor, U r] we find 


A Ui 




U 


= rn 


Ui+i,j i /, 0 . Ui—l,j 

+ (1 — 2m)-—-- + 7Ti- 


h3 


Ut 


hJ 


Ui 


h3 


u, 


h3 


4 


A = m 


Ui+K 
Ui 


+ 1 — 2 in + m- 


U^ 






Ui 


(23.95a) 


(23.95b) 


h3 
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and when we insert (23.94), we find that 


= mexp (—k(i + l)Ax\/—1 + kiAxy/— 1) + 1 — 2m 


+ mexp (—k(i — l)Axy/—l + kiAxy/^\) 

(23.96a) 

= 1 — 2m + m exp (— kAxy/— l) + mexp (kAxy/— l) 

(23.96b) 

= 1 — 2m + 2mcos(kAx ) 

(23.96c) 

= 1 — 2m (1 — cos(fcAx)). 

(23.96d) 


Thus, we conclude that (23.93) is a solution, when A = 1 — 2m (1 — cos(fcAx)) for some k. Since 
the solution does not depend on k, we further conclude that any linear combination of (23.93) is also a 
solution, that is, 

OO 

Ui,i= £ afc \ J k exp(— kiAxy/— 1)- (23.97) 

k =—oo 

Again, we find a Fourier series and find that the coefficients a\~ arc determined by the boundary function 
Uifi. In the continuous solution (23.91), the high frequencies were exponentially damped. In the numerical 
scheme (23.97), the solution is damped by Al, and thus requires that A/.| < 1 for all k will ensure that the 
solution does not diverge as j —» oo. The function |A&| takes its maximum value when k = for any 
integer n, in which case 

1 > A*, > 1 - 4m. (23.98) 

Since A*. < 1, we must require that 

|1 — 4m| < 1, (23.99) 

and for m > 0 this implies that m < Hence, again we find that when 0 < m < -j, then the high 
frequencies in the solutions will not be amplified and the solution will converge. 

Stability conditions for explicit equations typically implies very small iteration steps in the solution. 
A qualitative argument for this is that the characteristic direction of the heat diffusion equation is t = 
constant, implying that the flow of information from the boundary conditions is along t = constant. This 
contradicts the numerical implementation in (23.73), in which the value at (i, j+ 1) is determined by values 
at (i — 1 , j), (■ i,j ), and (i + I. j), implying characteristic directions © = ±^, and the characteristic 
directions only approaches the continuous directions when At —> 0 for some fixed Ax. 


23.2.3.2 Stability and Convergence for Implicit Solutions. 

The heat diffusion equation may be approximated by an implicit method, 

Ui ’ j+ At ^ + ° {At) = Ui+W - 2 ^ 2 +1 + Ui ~ 1J+1 + 0(Ax 2 ), 


(23.100) 


and comparing with the explicit solution (23.72), the spatial derivative u xx has been approximated as a 
central second-order finite difference approximation at time j +1 rather than at time j. The computational 
molecule for this implicit approximation is shown in Figure 23.6. This suggests an update rule as 
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t = jAt 



Figure 23.6: The computational molecule for an implicit finite differencing scheme for the heat diffusion 
equation. 


-mU i+ ij+i + (1 + 2m)Uij + i - mUi- ij+i = Uij, (23.101) 

where m = This is a linear system of equations in Uj j, and the solution for j + 1 is undetermined 
without the boundary conditions at j + 1 in agreement with the continuous solution. As an example, set 

u(x, 0) = 1, when 0 < x < 1, it(0, t) = 0, u( 1, t) = 0, Ax = At = -j, and i, j = 0,1,_The system 

of equations is then 


1 

0 

0 

0 

0 

0 ' 


1 


1 

—m 

1 + 2m 

—m 

0 

0 

0 


Uij+i 


tflj 

0 

—m 

1 + 2m 

—m 

0 

0 


U 2 ,j+i 



0 

0 

—m 

1 + 2m 

—m 

0 


U 3,j +1 



0 

0 

0 

—m 

1 + 2m 

—m 


U 4,j+l 


Uaj 

0 

0 

0 

0 

0 

1 


1 


1 


(23.102) 


using m = and starting with o = 1. The determinant of the system matrix is 1 + 8m + 21m 2 + 
20m 3 + 5m 4 . In general, it can be proven by induction that the determinant is 


det A n = (1 + 2m) det A n _i — m 2 det A n - 2 > 


(23.103) 


for n > 4 for systems on the form 

-4.3 

A 4 


1 0 

—m 1 + 2m 
0 0 

1 0 

—m 1 + 2m 

0 — m 

0 0 


0 

—m , 

1 

0 0 ' 

—m 0 

1 + 2m —m 
0 1 


(23.104a) 


(23.104b) 
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Since m > 0, it can be seen that det A n > 1, and that the system matrix always has an inverse. The most 
common solution is to decompose the system matrix into an LU decomposition (see Section 19.2). 

The convergence of this system is most easily studied using Fourier convergence. Similarly, to the 
explicit method we guess that the complex exponential 


Uij = A J exp(— kiAxV— 1) (23.105a) 

= \Uij-i, (23.105b) 

is a solution to the finite difference scheme. Inserting the recursive formulation and dividing by Uij, we 
find 


—m 


A(/j+l j ! ! r)™^ ^Ui,j 3.f 1 j U/ 


Ui 


+ (1 + 2m) J — m 


1,3 


1,3 


u, 


1,3 


Ui 


Ul 


A [ 1 + 2m — m 


( Ui-\-\j Uj— 


V Uij u hJ 


1,3 1,3 

4 


, 1 


and when we expand the complex exponential, we find that 


A(1 + 2m — m (exp(— kAx\/— 1) + exp(+kAx\/— 1))) = 1 

4 


1 + 2m (1 — cos(fcAx)). 


(23.106a) 


(23.106b) 


(23.107a) 

(23.107b) 


Thus, we conclude that (23.105) is a solution, when A = (1 + 2m (1 — cos(kAx))) \ and since this is 
independent of k, we further conclude that any linear combination of (23.93) is also a solution, that is, 


OO 

Uij = E ak\ J k exp(— kiAxV~ 1)- (23.108) 

k=— oo 


Again, we find a Fourier series and find that the coefficients, a /arc determined by the boundary function 
Uifi. In contrast to the explicit approximation, the implicit approximation is unconditionally stable, since 

A < 1. (23.109) 


23.2.4 Elliptic Equations 

In the following, we will discuss convergence and stability for the elliptic equations. For an elliptic 
differential equation, 

Allxx + Uu x y T Cllyy — 0, 

with B 2 — 4,4(7 < 0, there arc no real characteristic directions. 


(23.110) 
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The prototypical example of an elliptic differential equations is the Laplace equation : 


d 2 u d 2 u 

a^ + w +fix ' s) = 


This equation is quite similar to the 2D version of the heat diffusion equation (23.71), 


du d 2 u d 2 u 
dt dx 2 dy 2 ’ 


(23.111) 


(23.112) 


and the elliptical equation may be solved as the steady state of this. We will assume that the solution 
has been obtained somehow, and go on to analyze the truncation and convergence of the following finite 
differencing scheme: assuming a grid U^j — u(iAx,jAy) and using central differencing, we find that 


2C/jj T Ui—i j | Uij-y .i 2Uij + U^ j —i ^ n 
+ Ay2 + Kj - U - 


For short, we will write this on operator form 

L[U] + / = 0. 

The truncation error is found by inserting the true solution u in place of U, 

u(x + Ax, y) — 2 u(x, y) + u(x — Ax, y) 

Ax 2 

+ «*'V + *y)-M£v) + *x,v-*v) + /(a . s) = T(X: A 

and represent u by its Taylor series, 

^ (Ax) n ~ 2 + (-Ax)"” 2 d n . . 

> -:-—— u(x,y) 

t—* n\ dx n 

n=l 

, ^ (Ay)"’ 2 + (-Ay)"' 2 d n , , , „ , w , 

+ 2^-—- g- r r u ( x ’ y) + /( x > y ) = T ( x ’ v>- 

n= 1 n ' y 

Identifying leading terms, 

5 2 «9 2 

^(x,y) = / (x, y) + ^ u(x,y ) + -^u(x,y) 

At 2 /l 4 A 7/ 2 /9 4 

+ -wb u{x ’ y) + y)+ ° (ax4 +Ay4) 

At 2 Ft 4 An 2 F) 4 

= y) + ll*) + °( Ax4 + 


(23.113) 


(23.114) 


(23.115) 


(23.116) 


(23.117a) 


(23.117b) 
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where we have used (23.111) in the last simplification above. As discussed in Chapter 20, this will be 
limited by some constants M x 4 and M y t as 


\ A 

T(x,y) < 

= M 


(23.118a) 

(23.118b) 


The convergence of (23.113) is found by studying the error of the approximation as a function of Ax and 
Ay. The error is defined as 

&i,j = U id - u(iAx,jAy). (23.119) 

A bound on e may be obtained with the help of a comparison function. 


^= (x - iAx - - + y - jAy - - , 


(23.120) 


for which the second-order structure is simple. Now we produce a linear combination of (23.119), 
(23.120), and (23.118b), as 

1 


V’iJ — &i,j + 

Applying the finite difference scheme (23.113) to both sides of this equation gives 


(23.121) 


L[ip\i,j — L[e]ij + —ML[\i]ij 


(23.122) 


The first term is evaluated as 


L[e]ij = L[U]ij - L[u](iAx,jAy) 
= -T (iAx, jAy), 


(23.123a) 

(23.123b) 


since L[U] = —f according to (23.113) and L[u\(x,y) = T(x,y) — f(x,y) according to (23.115). The 
second term is evaluated to 

^ML[^]ij = M, (23.124) 

since is a quadratic function that does not contain higher-order terms. Hence, we conclude that 


L[ip]i,j = ~T(iAx, jAy) + M > 0, 


(23.125) 


since M is the maximum bound on T. To get a bound on T, we will use the maximum principle. 
The maximum principle is derived as follows: assume that an operator L is given on the form 

— 'y \ aUij, 

k£j\f 


(23.126) 
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where k is some set of neighboring positions AT, and where all the coefficients are positive, i.e., a /,. > 0 
and a > 0. Furthermore, assume that the coefficients arc given such that 


CL 7 ^ 
k£j\f 


(23.127) 


and that 


L[U]ij > 0 


(23.128) 


for all coordinates (i,j) in the interior of the domain, then cannot be greater than the maximum value 
at the boundary of the domain. This will be proven by contradiction: assume that there was a maximum 
Um > 0 in the interior of the domain, then L[U] > 0 =7 U ltJ < a kUk/a < a kPmt a < 
U rn . Hence, all values (//,. must also be equal to Um- Applying this argument to the neighboring values 
iteratively until we reach the maximum at the boundary, we must conclude that the maximum at the 
boundary is equal to the maximum in the interior, but this is a contradiction. 

Hence, for (23.113), we have that the coefficients arc positive in the sense of (23.126) and that (23.127) 
is fulfilled, since a = Ylk a k = ~xT ? + Hence, we can conclude that cannot be greater than its 
neighbors due to (23.125). Thus, the maximal value must be located on the boundary of the domain, but 
here e t>J = 0, hence, 

1 M 

ma xipij = max—MT'jj = ——. (23.129) 


Furthermore, since T is nonnegative, we conclude that 


, 1 , M Ax 2 Ay 2 

ei j = ipi h —-M$j j < ibi j < — =- M r 4 H- M.a. 

h3 J 4 r ’ J ~ 8 96 96 y 


Similar analysis for —e% j will show that 


Ky| = \U itj - u(iAx,jAy)\ < 


(23.130) 


(23.131) 


To conclude, the finite difference scheme (23.113) will converge to the continuous solution as {Ax, Ay) 
0. 


23.2.5 Hyperbolic Equations 

In the following, we will discuss convergence and stability for the hyperbolic equations. First, we will dis¬ 
cuss characteristic directions for hyperbolic second-order equations, and then we will study the numerics 
of the more common hyperbolic first-order equation. 

For a hyperbolic second-order differential equation. 


-Auxx 4“ Dllxy T CUyy — 0, 


(23.132) 
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with B 2 — A AC > 0, there arc two characteristic directions given by ^ = B± ^ 2 A 4AC ■ These 

directions arc perpendicular when 


1 


1 

-B+y/B 2 -AAC 


-B-VB 2 -4AC 

L 2 A J 


L 2A J 


that is, when A = —C. As an example, consider the Wave equation , 

tttt V'XX — 0 ) • 


(23.133) 


(23.134) 


Seeing that A = —C = 1 and B = 0, we realize that the characteristic directions are perpendicular. The 
slope of the characteristic direction may be found to be ±1 by evaluating The characteristic directions 
are special, since they simplify the partial differential equation. That is, when transforming the equation 
to a coordinate system aligned with the characteristic directions. 


a = x + t, (23.135a) 

(3 = x — t. (23.135b) 

we may write u(x, t) = v(a, ft), and we may substitute v for u in (23.134). Evaluating the second-order 
derivatives of v w.r.t. x and t we find 


Vtt — v aa 2v a p T v (jfj, 
Vxx — v na + 2v a q H~ Vfis ? 

and we therefore conclude that (23.134) is equivalent to 


(23.136a) 

(23.136b) 


V a /3 = 0. 

It is readily checked that solutions to this equation are specified by 

v{a,ft) = f(a) + g(ft), 


(23.137) 

(23.138) 


or equivalently 


u(x, t ) = f(x + t) + g(x - t), 


(23.139) 


for any twice differentiable functions / and g. In this example, we see that the hyperbolic equation 
has a limited interval of dependence, as shown in Figure 23.7. More rigorously, if we fix the boundary 
conditions u(x, 0) = F{x) and uftx, 0) = F t (x), then we find 


u(x, 0) = f(x) + g(x) = F(x), 
ut(x, 0) = f'(x) - g'(x) = F t (x). 

Differentiating (23.140a), we find two equations in f and g' as 

f'{x) + g'(x) = F\x), 
f\x)-g\x) = F t (x), 


(23.140a) 

(23.140b) 


(23.141a) 

(23.141b) 
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t = jAt 



Figure 23.7: The interval of dependence for the wave equation at the point (x, t ) in the solution is [x — 

t, x + 1]. 


and solving for f and g', we have 


/'(*) = \ i F \ x ) + F t(x)) , 

9 '(x) = ^ (F\x) - F t (x)) . 


Integrating both equations gives 


f(x) = £ f\x) dx = ^ £(x) + £ F t (0 dtj , 
g(x) = g'[x) dx = ^ £(x) - F t +) d£ , 

and thus, 


u(x,t ) 


f(x + t) + g(x — t) 

1 ( r x+t 

2{ F{x+t)+ L 

^ £(x + t) + F(x - 
^ + t) + F(x - 


F t (o <%) +\ £(x -*)-£' F t(o £ 

rx+t r0 \ 

-t)+ F t (Z) d£+ F t (Z)dZ) 

JO Jx—t / 

rx+t \ 

-t) + J f F t (Z)dZj. 


(23.142a) 

(23.142b) 

(23.143a) 

(23.143b) 

(23.144a) 

(23.144b) 

(23.144c) 

(23.144d) 


Thus, the values of u at x. t only depend on the values of F at x + t, and x — t and on F) in the interval 
[x — t,x +1\. 
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The prototypical example of a hyperbolic differential equation is the linear advection equation , 


du du 
dt a dx' 


(23.145) 


The solution to the linear advection equation is given by (23.47), as 


u(x, t ) = f(x + at), 


(23.146) 


for any differential function /. Comparing with the second-order equation (23.144d), we find that they arc 
similar in the sense that the flow of information is along lines, but in contrast to the second-order equation, 
the solution to the advection equation depends only on a single point on the boundary u(x. 0) = f(x). 

Hyperbolic differential equations in several dimensions arc rather complex to solve numerically, and 
we will limit ourselves to the discussion of the simplest case of the linear advection equation (23.145), 
which is important for the much used Navier-Stokes equation discussed in Section 11.2. In the following, 
we will approximate the advection equation on a grid u(iAx,jAt) — (/,, ? , where U is an approximation 
of u. We further impose boundary conditions U Xt o = u(x. 0) = b\ (x). 


23.2.5.1 Stability and Convergence of the Upwind Schemes 

The lineal - advection equation (23.145) is straightforwardly approximated by a forward finite difference in 
time and either a forward or backward finite difference in space, depending on the characteristic direction. 
We will assume that U r ,j > 0, hence, 


U, 


i,j +1 


-Ui 




At 


= —a 


Utj - Ut _ 


1 ,3 


Ax 


(23.147) 


The truncation error is found in the standard way by replacing U with u represented by its Taylor series, 


u(iAx, (j + 1)A t) - u(iAx,jAt) u(iAx,jAt) - u((i - l)Ax,jAt) 
1 [x, t) = -—-1- a- 


At 


Ax 


E 


At 


n— 1 Qn 


n— 1 

At d 2 


n\ 8t n 


u 


M + af 


n= 1 

—Ax d 2 


ni 


= - W2 u(x,t) + a 2 &2 


dx n 

2 , a„2x 


u(x,t ) + 0(At z + Ax z ) 


(23.148a) 

(23.148b) 

(23.148c) 


where we have used (23.145) in the last simplification. Thus, T —> 0 linearly as (At, Ax) —> 0. 

The forward finite difference in time and backward difference in space suggest the following update 
rule: 

Uij+i = (1 - m)Uij + mUi-ij, (23.149) 

using m = For this update rule, the value at time (j + l)At depends on the small interval iAx, (i — 
1) Ax at time j At, which recursively generalizes to a triangle of dependence as illustrated in Figure 23.8. 
Comparing with the analytical solution, u(x, t) = u(x — at , 0), we realize that the analytical solution can 
only be properly approximated by this update rule if a > 0, and further only if m < 1. This is the so-called 
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t = jAt 



to 


Figure 23.8: The forward time and backward space approximation of the linear advection equation gives 
a triangular domain of dependence. 


CFL condition [Courant et al., 1928], that is, that the flow of information should lie within the numerical 
triangle of dependency. The CFL condition is a necessary, but not sufficient, condition for convergence. 

The Fourier analysis of (23.149) is again based on the guess that 


Uij = X J exp(F i A x\[—\) 


(23.150) 


is a solution, and we insert this guess into (23.149), use the recursive formulation, = A Uij, divide 

by Uij , which gives 


^Ui j , , Ui j 

= (! — 

4 


+ m- 


Ui- 


j-hi 
Ui 


hi 


A = 1 — m + mexp (k{i — l)Ax\/^T — kilAx\J — l) 
= 1 — m + mexp (— kAx\/~ l) ■ 


(23.151a) 

(23.151b) 

(23.151c) 


To find the Fourier convergence, we use the length of a complex number 


a + V^l[3\ = \/« 2 + P 2 


(23.152) 
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Figure 23.9: The function 1 — 4(1 — m)m in the interval [—1,1]. 


and evaluate the squared length of A as 

|A| 2 = |1 — m + mcos(kAx ) — mv/^Tsin(fcAx)| 2 (23.153a) 

= (1 — m + mcos(kAx)) 2 + sin(kAx)) 2 (23.153b) 

= (1 — m) 2 + 2(1 — m)m cos(kAx) + m 2 cos 2 (FAx) + m 2 s'm 2 (kAx) (23.153c) 

= (1 — m) 2 + 2(1 — m)m,cos(kAx) + m 2 (23.153d) 

= 1 — 2m + 2 m 2 + 2(1 — m)mcos(kAx) (23.153e) 

= 1 — 2(1 — m)m + 2(1 — m)mcos(kAx) (23.153f) 

= 1 — 2(1 — m)m( 1 — cos(fcAx)). (23.153g) 


Since the range of cos is [-1,1], we conclude that for | A| < 1 is equivalent to requiring that 1—4(1 —m)m < 
1. This is a quadratic function illustrated in Figure 23.9, and we conclude that it is sufficient to require that 
m < 1 as also indicated by the CFL condition. This could suggest that the full interval of a / 0 would be 
covered by a central differencing scheme in space, that is, 


U, 


i,j+l 


-u, 


h3 


At 


+ 0(At) = -a 


^i+l,j Uj ,—ij 

2Ax 


+ 0( Ax 2 


(23.154) 


but using (23.150) it may be shown that this scheme has |A| > 1 and does therefore not converge. The 
best choice is therefore to use the upwind scheme, where a backward differencing scheme in space is used, 
when a > 0, and a forward difference scheme in space is used, when a < 0, that is, 


l — 


(1 - m)Uij + mUi- ij, 
(1 + m)Uij - mUi- ij, 


when a > 0, 
when a < 0. 


(23.155) 


Just as for the case of backward difference in space, when a > 0, it may equally be proven that rri < 1 
is sufficient for convergence of the forward difference in space. Further investigation into the truncation 
error will show that the upwind scheme always has an amplitude in the order of kAx and a relative phase 
error, which vanishes at kAx = \ (see [Morton et al., 1994, Chapter 4.4] for the derivation). 
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Figure 23.10: Comparing the Lagrangian and the Eulerian method for moving an interface: an interface 
is moved along an external vector V. 


23.3 Level-Set Methods 

The motion of particles moving in an externally generated velocity field is given as 

(23.156) 

where r * and v r arc the position and velocity of particle i at time t. This is a Lagrangian view of the dynam¬ 
ical system discussed in detail in Chapter 8, and it is very useful for animating systems where the configu¬ 
ration of the particles is fixed or nonexisting. For objects that can change topology, such as water that may 
separate into droplets, the splitting and merging process becomes complicated [Mclnerney et al., 2000], 
When topology may change, it is more natural to define the object boundary as an implicit function, u, 
and apply the level set equation [Osher et al., 1988, Sethian, 1999, Osher et al., 2003] 


ut + v ■ V x u = 0, (23.157) 

where u is an implicit function such that 

u(x, y, z, t) = 0, (23.158) 

defines the interface between, for example, water and air, and v is now a velocity field sufficiently smooth 
near the interface. The equivalence of (23.157) with (23.156) is shown in Figure 23.10. The figure shows 
a particle at its initial position in r(to), and later after the application of the velocity vector v, resulting 
in position r(ti). It is assumed that the time-step is one unit of time. Equivalently, the implicit function 
u(x,t) intersects the horizontal axis at u(r,to) = 0 initially, and since the function is locally linear, the 
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addition of —vV x u will cause the interface to move to u(r\ t \) = 0. We thus conclude that the level-set 
equation (23.157) implements (23.156), when the implicit function is linear in a band vV x u around its 
interface. 

The most used implicit function is the signed distance map, which for an object with boundary SO is 
given as 

d(r) = (—l) / ( r ) min ||ro — r|| , (23.159) 

ro 

where tq e 50 is a point on the interface and I(r ) is the indicator function. 


I{r) 


1, if r is inside the object, 
0, otherwise. 


(23.160) 


The signed distance map is numerically advantageous since = 1 in almost all points. It is very 

convenient for collision detection, since sgn d can be used to detect intersections between objects. If 
d(r) < 0, then d(r)V x d{r) is the shortest route out to the boundary of the object, where dir) = 0. 

In contrast to the Lagrangian formulation, where an interface is represented directly as a set of control 
points or particles, the level-set approach depends on the spatial sampling of the implicit function and 
speed vector field. This is however, analogous, since finer sampling in space achieves finer representation 
of the interface, just as adding more particles does in the Lagrangian formulation. Methods also exist 
for sparse representation of the implicit function, such that the memory requirement can be reduced to be 
near that of the Lagrangian system. For some systems, v is only determined on the interface, that is, at 
d = 0, and for the level-set method we need to extend the velocities at least to a band around d = 0. Since 
the interface is moving in the normal direction, the most natural solution is to impose constancy in the 
velocity field along the normal. However, setting v(r) to be equal to the velocity of the nearest point on 
the interface seems to preserve signed distance maps. 

We will analyze the simplest case of one space and time dimension. 


U t (x,t ) = -v ■ u x (x,t ), 


(23.161) 


This equation is the linear advection equation (23.145), which holds for (23.157) in general, hence we 
must consider the directions of the characteristics. Assuming a grid (7 (J ~ u(iAx,jAt), a forward finite 
difference in time and an upwind scheme in space gives 


U, 


hj +1 


-u, 


h3 


Ui+1 i Ui ’ j , if u, 


At 


Ax 
Uij-Ui- 


Ax 


hi. 


h3 — 

otherwise. 


The truncation error is found similarly to (23.148c) to be 


(23.162) 


T(x,t) = + 


v=^-^u{x,t) 


if Uij < 0, 
otherwise. 


+ 0(At 2 + Ax 2 ) 


(23.163) 


Likewise, we find the CFL conditions using (23.153g) to be 
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for constant positive velocities, and 


At < 


Ax 

max x ||n 


(23.165) 


in general. 
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In this section we will present the classical theory of nonuniform B-splines and simultaneously we will 
derive a c-style pseudocode for efficient implementation of the theory. 

The theory of B-splines is a much wider area than the treatment we give in this paper. Therefore, we 
encourage the reader to look at our references [Piegl et al., 1995, Watt et al., 1992, Hoschek et al., 1993, 
Boor, 1978, Boehm, 1980, Farm, 1993, Erleben et al., 2003a] if a wider coverage is wanted. 

24.1 The B-Spline Basis Functions 

A B-spline is a parameterized curve in the parameter u. It can be evaluated as the interpolation of a set of 
basis functions. Therefore, before we look at the B-splines we will describe the B-spline basis functions. 
We’ll start by defining the normalized basis function for the B-spline. 

Definition 24.1 

The normalized basis functions, N, are defined recursively by the Cox de Boor definition. For k > 1: 


U — tj tj-Ll- — u 

N ilk = - - —Ni , k -1 + - l+k - JVi+r.fc-!, 

1 H H+l 


and for k = 1: 


Ni, t = 


1 if ti < u < t i+ i , 
0 other-wise. 


(24.1) 


(24.2) 


The index k is called the order of the basis function. For a given value of k this definition results in a 
polynomium of degree k — 1. All the tiS are called knot values, and are usually arranged in a so-called 
knot vector, T: 

T = [... ,ti-i,ti,t i+1 ,t i+ 2, ■ ■ -} 1 ■ (24.3) 

To better understand the definition (24.1), we will consider a value u that lies between two knot-values 


11 ' u tj- pi- 


(24.4) 


Looking at the definition of the B-spline basis functions (24.1), we can easily derive the dependency table 
shown in Table 24.1. The table should be read bottom-up. Stalling at the bottom with the knot vector, one 
can track all nonzero basis functions up to any wanted fe-value. Observe that Nj + yi is zero due to (24.2). 
The pattern is obvious and we could easily have extended the table to any value of k. 

There arc three properties, which can be seen immediately from the table and which we arc going to 
put to practical use later on. 
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k — 3 0 IV'i—2,3 Ni— 1,3 0 

k = 2 0 W— 12 0 

fc = 1 ... _0__0_^ 

ti-i ti U ti +1 /'i+2 


Table 24.1: Dependency table showing only nonzero basis functions at the value of u. 


1. At the k’th order there arc exactly k basis functions, which arc different from zero. 

2. From the table we can derive that if 

ti 5; U Zi-f-1 ■ 


then only 


Ni—k+l,ki ■ ■ • i 


(24.5) 

(24.6) 


will be nonzero. 

3. Computing the basis functions bottom-up, instead of top-down, will make it possible to reuse pre¬ 
viously computed results. 


We will use properties one and two to make a sort of fast rejection on which basis functions we need to 
compute. The third property is obvious from the table, but it requires some work before it can be applied 
efficiently in a practical implementation. 


24.1.1 Implementing the Basis Functions 

Let’s look at an example. Assume we have 


ti <u < t i+ 1 , (24.7) 

Then, by the definition of the basis functions, and the properties one and two, we can write up all the 
nonzero basis functions: 


k = 1: 

k = 2: 


N iA = 1. 


(24.8) 


Ni- 1,2 = 


Ni, 2 = 


U ^ + ? N iA = 


tj+0 ti —1 

U — t 


tj+1 ti 


kj ■ 1 tj. 


Ni. i + 


U+2 — U 


ti+2 1 




i+1,1 


tj+1 

ti+1 ti 
U-tj 
tj+1 ti 


Ni, i, 

/v u - 


(24.9a) 

(24.9b) 
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k = 3: 


at _ U ti—2 j. T , ti+1 u _ t'i+1 U *■]- 

Ai- 2,3 — 7-7--iVj_ 2,2 + 7-7-— ---Aj_i )2 , 


Ni- 1,3 = 


/*+0 / j—2 

U - L-l 
/*+l ti— i 


/j+1 ti— 1 


/j+1 1 


^Vj-1,2 + 


/j+2 ~ 'M 
ti+2 ti 


Ni 


i. 2 ? 


U ti ^ 7_|_3 U U ti 

Ni t 3 = 7-7~/V^i,2 + 7-7--^i+1,2 = T-r-^i, 2- 


/j+2 L: 


/i+3 — /i+1 


/i+2 ti 


(24.10a) 

(24.10b) 

(24.10c) 


We can immediately make a few observations. For instance, if we look at Ay__ | 3 , then we observe that 
the part -———Ay 2 of the second term reappears in the first term of Ay 3 . Another observation we can 
make is that the first paid of Ay_ 2 , 3 is always zero and the same goes for the last paid of Ay 3 . This is seen 
directly from the properties observed from Table 24.1. These two observations can be used to speed up 
computations. 

Looking at the fractions of the knot differences, we see that they appeal - to be similar for increasing 
values of k. This similarity can be systematized by introducing two new auxiliary functions called left and 
right. These are defined as follows: 


1 

+ 

-+o 

1 

3 

II 

<£h 

(24.11a) 

right(j) = t i+j - u. 

(24.11b) 

Rewriting the example for k = 3, using the auxiliary functions yields: 


at _ left (3) AT right (1) 

* 2,3 right(0) + left(3) * 2,2 right(1)+left(2) 1 1 ' 2 ’ 

(24.12a) 

,r left (2) . right (2) Ar 

1 1,3 right(1) - left(2 ) N% 1,2 + right(2) + left(l) N ' ,2 ’ 

(24.12b) 

AT _ left (!) AT , ri g ht ( 3 ) 

*’ 3 right (2) + left(l) 1,2 right (3) + left(0) 1+1,2 

(24.12c) 

From this, we can derive the following general relation: 


Af (r, /c) (fc—r— l),k 

(24.13a) 

left (k - r ) 

right(r) — left (— r) 1 

right (r + 1) 

right(r + 1) + left(/c — r — 1) l,k 

(24.13b) 

where 


0 < r < k. 

(24.14) 


By close examination of (24.9a), (24.9b), (24.10a), (24.10b), and (24.10c) from earlier, we notice that 
when computing a new k’ th level, the only left and right values not already computed are left (A;) and 
right (A). We are now ready to write up pseudocode for computing the values of all nonzero B-spline basis 
functions A^ (see Figure 24.1). 
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algorithm BasisFunc(i,u,K,T) 
left = array(K+l) 
right = array(K+l) 

M = array(K) 

left [ 0] = u - T[1+1] 

right [0] = T[i] - u 

left [ 1] = u - T[1] 

right[1] = T[1+1] - u 
M [ 0 ] = 1 

For k = 2 to K do 

left[k] = u - T[i+l-k] 
right[k] = T[i+k] - u 

saved = 0 

For r=0 to k-2 

tmp = M[r]/(right[r+1]+left[k-r-1]) 
M[r] = saved + right[r+1]*tmp 
saved = left[k-r-1]*tmp; 

Next r 

M[k-1] = saved 

Next k 
return M 
end algorithm 


Figure 24.1: Pseudocode for computing nonzero B-spline basis functions. 


24.1.2 The Derivatives of the Basis Functions 

It will prove useful to us later to continue studying the basis functions, as it turns out the derivative of a 
B-spline can be written as the interpolation of the derivatives of the basis functions. Therefore, we seek a 
convenient and tractable method for computing the derivative of the basis functions. 

We will start by showing the result of computing the first-order derivative of a basis function. 

Theorem 24.1 (Derivative of B-Spline Bases) 

The first-order derivative of a B-spline basis function is given by 


dN it k 

du 


k - 1 


ti+k—l H 


Ni,k -1 - 


k- 1 


ti+k l 


-Ni 


i +1 


i+l.fc-l- 


(24.15) 


Proof of Theorem 24.1: 

We will prove the equation by induction on k. Setting k = 2 we see by direct differentiation of (24.1) that 
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the derivative would be either 


-1 


ti+k—1 ti+k ti +1 


or 0 


(24.16) 


depending on what interval u lies within. By direct substitution into (24.15) we can easily verify that 
(24.15) is true in the case of k = 2. Assume now that (24.15) is true for k — 1 and that k > 2. To prove 
that (24.15) is true for the fc’th case, we start by applying the product rule to (24.1) and get the following. 


dNj 


i,k 


i 


dn 


ti+k—1 H 


Nik- l— 


1 


ti+k /f+l 


-Ni 


i-\-l,k— 1 


^ u ti dNi^—i ^ ti-\-k ^ l 


We substitute (24.15) for the terms 

dNi^ _ 1 


^i+k—l t 

dNi k—l 


du 


^i+k ti +1 


d'U 


du 


+ 


ti+k— 1 ti 
U — ti 


—-and 

du 

Ni,k -1 ~ 


dN, 


i+1,k 1 and get the following, 

-Ni+ l,fc—1 


du 

1 


ti+k /»+1 


ti+k— 1 ti \ti-\-k—2 t 


k -2 


k — 2 


+ 


ti+k tr 


A: — 2 


ti+k fj+1 \ti+k— 1 fj+1 

By reai'ranging terms we get 

d/Vifc 1 


Nifc-2 ~ 

ti-\-k— 1 H+l 

Ar A: - 2 

— A, +1)fc _ 2 - --— 


du 


-Ni 


1 


2,A— 1 


+ 


+ 


ti+k —1 t{ fj+fc fi+1 

A; — 2 u — ti 

---/V* ifc _ 2 

ti+k— 1 ti ti-\-k—2 ti 

k — 2 f t i+k - u _ 
i+k—l ti -\-1 fi+1 


j+fc n+2 


t^i+l,k— 1 


A/i+l,fc-2 

— ZVj+2,fc-2 


U — ti 


ti+k— 1 ti 


N 


i+l,k—2 


k-2 t i+k - u 


N 


i+2,k—2- 


ti+k H+l ti-\- k ti -\-2 

We perform a mathemagical trick. We add zero to the equation by adding 1 and subtracting 1: 

1 1 


du 


+ 


Zf+fc—l ti 

k-2 


Nifi-i ~ 
u — ti 


ti+k ti +1 


-iv. 


i+l,fc—1 


ti+k— 1 ti /.j_(_fc_2 Z ? ; 


-Nib- 


i,k —2 


+ 


k-2 


ti+k U 


ti+k— 1 Zj-)_i Zj_)_i 

A' 2 t+\- k U 


-1 + 1 - 


U — ti 


ti+k— 1 ti 


N, 


i+l,k— 2 


-AA 


i+2,fc—2 • 


(24.17) 


(24.18) 


(24.19) 


ti+k Zj-(-1 tj+fc Zj_)_2 


(24.20) 
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Expressed as fractions, we get: 


dN it k 

du 


1 


+ 

+ 

+ 


ti+k—1 / 

k- 2 


-+fc-1 - 
u — tj 


ti+k—1 ti ti+k—2 ti 


ti+k ti +1 
-Nik-2 


-N, 


2+1,/c—1 


k -2 


ti+k tt /i+fc /j+1 


ti+k— 1 Z*+l \ti+k ti +1 Zj+fc /j+1 
tj+k— 1 tj H 


ti+k—1 /j+fc—1 ti 

k 2 /j+fc rr 


iV, 


i+l,fc—2 


/j+fc ij+1 tj+k /j+2 


iv. 


i+2,k-2- 


Setting on common denominators, we get: 


du 


1 


Ni t k-i ~ 


1 


+ 


tj+k— 1 ti +-fc /j+1 

A; — 2 u — ti 


-N 


i+l,fc—1 


^2+/c—1 ^2 ^i+k—2 


-Nik-2 


+ 


fc - 2 


ti-\-k— 1 /j+1 


ti+k tt ti+k 4“ #1+1 


tj+k t /+1 


+ 


iV, 


ti+k— 1 ti U -\- ti 
ti+k— 1 — ti 
k~ 2 tj + fc - U 
ti+k /i+1 ti+k Zi+2 


2+1,/c—2 


-iv, 


i+2,jfc-2- 


(24.21) 


(24.22) 


Reduction yields: 


d/V,- 


i, k 


i 


du ti+k—l ti 
k -2 


+ 

+ 


JVi.k-1 - 

u — tj 


ti+k ti +1 
-Nik-2 


-Ni 


2+1,/c— 1 


^2+fc—1 ti ti-\-k— 2 — ^2 

/u 2 / ^2+1 U ^ ti+k— 1 U 

ti-\-k— 1 ^2+1 \^2+fc ^2+1 ^2+fc—1 ^2 

k 2 ^2+fc ^ 




2+1, k —2 




2+2, k— 2* 


^2+fc ^2+1 ^2+fc ^2+2 


(24.23) 
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Now we arc almost at our goal. By rearranging the equation once more, we get: 

dN hk 1 1 


du 


+ 


ti+k— 1 11 

k - 2 


Ni,k -1 - 


ti+k li +1 


-N, 






U ti „ T - U 

-Aa- 2- , Ani.A;—2 




k — 2 


ti-\-k ^i -\-1 1 ^i+1 


“ ti+1 iV w _ 2 + / i+fc / iW- 2 ). (24.24) 


ti+k ti +2 


Looking at the terms in the parentheses, we immediately recognize (24.1), which we apply to get the 
following: 


dNi 


i.k 


l 


du 


-N, 


1 


i.k— 1 


Ni+l,k-l 


+ 


ti+k—1 t 

Cleaning up a bit more we get: 

dNi k (k — 2) + 1 


ti+k— 1 I'i t-i+k ti+1 

k-2 AT k- 2 Ar 

- 7-7-^*+1,*:-! 


ti+k ti +1 


(24.25) 


du 


ti+k— 1 H 
k- 1 


Finally, we can conclude that (24.15) is true. 


JV,*-, - 

W+fc W+l 

fc — 1 

dd%,k —1 7 7 ^i+l,fc—1- 

tj+fc — t*+l 


(24.26) 

□ 


Now let us tty to look at higher-order derivatives. If we differentiate (24.15), we get: 

d 2 iV»,fc _ k- 1 dN i)k _x _ fc - 1 djVt + i,fc-i 

du? tj, du ^i+fc ij+1 dn 

From this, it is not hard to see that a general formula can be written for the j’th derivative: 

d j N itk = k - 1 d^N^. i _ fc - l d J ’- 1 iVt-|- 1| fc- 1 

dw> t i+k -i-ti dui~ l t i+k -ti + 1 diF -1 

Our results can be generalized into the following theorem. 

Theorem 24.2 (Higher-Order Derivatives of B-Spline Bases) 

For j < k the higher-order derivatives of a B-spline basis function is given by 


d j N kk 

dui 


(fc-1)! 

(■ k~j~ 1)! 


p =o 


® J /pd^i+p,k—j ) 


(24.27) 


(24.28) 


(24.29) 
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with 


«o,o — 

1, 

(24.30a) 

a i,o = 

dj- 1,0 

(24.30b) 

c-f» 

+ 

?r 

1 

1 

®j,p — 

Qj—lfP Q>j—l,p—l 

(24.30c) 

ti+k+p—j — t-i+p 

a 3,j = 


(24.30d) 

H-\-k ti+j 


Proof of Theorem 24.2: 

We will now tty to prove (24.29) by induction on j. We’ll start by examining if the equation holds for 
j = 1. From direct substitution into (24.29) we get the following result: 


dN i>k _ (fc - 1)! 
du (k — 2)! 

= (k~ 1) 

= (k- 1) 
k- 1 


(ai,oNi,k-i + oi.iJVi+i.fc-t) 
«0,0 A7 , —«0,0 


t-i+k—l / 

1 


ti+k /i+l 

1 


-K 


/j+fc—t / 


— 

ti+k— 1 ti ti+k /»+l 

A: — 1 

d^i,k— 1 ~ 7 ^i+l,fc—!■ 


i+l,fc—1 


-TV, 


i+l,fc—1 


^i+k ti+1 


(24.31a) 
(24.31b) 
(24.31c) 
(24.3 Id) 


We immediately recognize (24.15) and conclude that (24.29) is true for j = 1. Now, we’ll assume that 
(24.29) is true for the case of j = j — 1 and we’ll try to prove that it also is true in the j'th case. We have 


dP~ l 


(k~ 1)! 


(k ~ U - 1) - !)• 


3~ 1 

^ " Q>j—i,pNi-\-p k—j+i- 


(24.32) 


p=o 


If we differentiate this once more we get the j’th derivative, that is, 


d?Ni t k _ {k — 1)! dNi + p^-j+i 

- (k — (j — 1) — 1)! ^ Q aj ~ hp Tu 

Now let us apply (24.15) to the derivative inside the summation: 


d>N iik 

dP 


(*-1)1 V 

(k-j)'-h0 

k~j 


k~j 


t-i+p+k—j /j+p 
^i+p+l,k—j 


-N, 


i-\-p,k—j 


(24.33) 


H-\-p+k—j+1 ^i+p+1 


(24.34) 
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Rearranging yields: 


d 3 ^i,k _ (k — 1)! ( a j-l,p nr 

dV {k — j — 1)! “ \t i+p+k -j - U+p t+p,k ~ 3 

a j~l,P AT \ 

, ^l+p+l,k-j I • 

H-\-p-\-k—j+l ^2+p+l / 

Let’s write out the summation into explicit terms. This results in the following: 


(24.35) 


dPN iik _ (.k - 1 )! ( aj-ifi 

dP {k-j-1)1 \t i+k -j - U 

_L __ AT. . . . 




_ Qj-1,0 

ti+k—j -\-1 /j+1 

°±PA _ AT. .. 


H+l,fe— j 


+ 7- J JVi +1 ,fc-j - 7 - J ^i+2,fc-j 

n+l+fe—j H-fl W+fc—j+2 ^i+2 


a i“l,2 


a i-i,2 


, iv i+2,fc-j , -‘ v *+3,fc—j 

^2-|-2-|-/c—jl ^2+2 I’i.A-h —7-1-3 ^7.-1- 3 


i+fe—j+3 ^i+3 


I AT- _ 1U 

H+k—1 H+j—l L i-\-k H-\-j 

This equation can be rewritten by collecting terms of equal iV-factors: 


(24.36) 


d j N i<k _ ( k - 1)! / aj-i,o N 

dP (k-j- 1)! \t i+k -j - U l,k ~ 3 

, - a i-T0 AT 

+ 7-“7- Ali+l,k-j 

T'i+l+k—j ^i -\-1 


Qj-1,2 a j-l,l 
^i+2+fc—j ^2+2 


’i+2,fe—j 


a J-1J-2 
^i+k— 1 — ti+j— 1 
a i-Ti-i A7 . ^ 




ti+k ti+j 


s i+j,k-j 


(24.37) 


Looking at how the a-coefficients in (24.30a)-(24.30d) arc defined, we can easily rewrite the equation 
into the following form: 


d?N i}k (k - 1)! 


dP (k-j- 1)! VJ ’ U 

• • • + a,jjNi+j tk —j) , 


ifljfiNi k—j T q_j,l/Vi+l,fc —j T a ].2^i+2,k—j T 


(24.38) 
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Ni,! 

Ni- 1,2 

i 

to 

CO 

Ni- 3,4 

1 ti 

Ni, 2 

Ni- 1,3 

Ni- 2,4 

ti+l ti—1 

ti- 1-2 ti 

Ni, 3 

Ni- 1,4 

^i+1 ti—2 

ti+2 ti— i 

ti-\- 3 

Ni, 4 


Table 24.2: A/-table example for k = 4. 


which conveniently can be written as: 


d j N ijk _ (k - 1)! 


dt j {k-j- l)!£r 0 


yy a j,pNi 


i+p,k—j- 


By comparison with (24.29), we can now conclude that it also holds in j’th case. 

24.1.3 Implementing the Derivatives of the Basis Functions 

Now we’ll turn our attention toward making an efficient implementation. By (24.29) we have 


dPN itk _ (k- 1)! 


dui (fc-j-l)!^ 


yy a j,pNi 


i+p,k—j j 


j < k. 


and we see that the numerators in (24.30b)-(24.30d) are 

/AT — ti-\- k -\-p—j foi 0<p<j. 


(24.39) 

□ 


(24.40) 


(24.41) 


By (24.1) we have 

x,= - " ~ U+r _-' -(V j+r -u-, + 1 V W -j. (24.42) 

H+k+p—j—l H-\-p— 1 ti+k+p—j ^i+p 

Now, if we compare the numerator of the second term to (24.41), we immediately see that they are iden¬ 
tical. In other words, we have seen that the knot differences used to compute the iV-values can be reused 
in the computation of the derivatives. Weil create a table, which we call M, and for 0 < r < k we store 
values into M as follows: 


(24.43) 

(24.44) 

The diagonal is special since we will store Nj^ k in it. In Table 24.2 we have given an example of an 
M -table to help familiarize you with it. Observe that M contains all the nonzero values of N from the 
dependency table and it also stores the corresponding knot differences (from the right terms only) in the 
transpose position of the corresponding N value. 


Ni—k+l+r,k * M r k— 1: 

AT —> M k _i r . 
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The knot difference we need in the a-coefficient of N i+p ^_j is computed by N i+p _i k-j+i- I n the 
M- -table, N i+p _i^_j + \ would lie just to the right of N r+p j. r Knowing this, we can easily come up with 
the following rule for looking up the knot difference we need, when we want to compute the a-coefficients. 
The rule is as follows: 

1. Get the transpose position of the corresponding A-value. 

2. Add one to the row index. 

If we compute the derivatives in succeeding order starting at 0, then 1,2, ...j — I. j, and we observe 
that when we compute the j’th derivative, we only need the a-coefficient values, which we found in the 
computation of the j — l'th derivatives (and the knot differences of course). This means that we only have 
to remember the a-coefficient from the previous iteration. Using a cyclic memory buffer makes it possible 
to implement a very efficient way of computing the a-coefficients. 

Looking closely at (24.29), we see that some of the terms involved in the summation could potentially 
have a zero A-value. In an implementation, we could take advantage of this knowledge as follows: first 
we split the summation into three parts, each paid corresponding to the three different ways of computing 
the a-coefficients (equations (24.30b)-(24.30d)). That is, we arc going to use the following index values 
for the A—values: 

* — k + 1 + r for ayo (24.45a) 

i — fe + l + r + ltoi — k + l + r + j — lfor ayi to ajj -1 (24.45b) 

i — k + I + r + j for ajj. (24.45c) 

Now, since all these A-values arc of order k — j we know that there arc at most k — j nonzero A"- values 

and we also know that their index range is 

i — k + j + 1 to i. (24.46) 

In other words, if 

i — k + l + r>i — k+j + 1, (24.47a) 

r > j, (24.47b) 

then the ayo term have a nonzero A-value and should therefore be included in the summation. Similarly, 
we see that if 


i — k + l + r + j <i, (24.48a) 

r < k — 1 — j. 


(24.48b) 
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Then the ajj term should be included as well. The a ] V terms (for p = 1 to j — 1) arc handled a little 
differently. In this case, we want to derive the range of p-values, p m in, ■ ■ ■ ,Pmax , which results in nonzero 
values. First, we look for a lower limit. If 


i — k+l+r+l>i— k + j + 1, 
r + l>j, 


(24.49a) 

(24.49b) 


then pmin should be set to 1. If not, we have to solve for p m in, such that we find the lowest value of p m i n 
where: 


i-k + l + r + Pmin >i - k + j + 1, 

(24.50a) 



Pmin j Ti 

(24.50b) 



Pmin = j T. 

(24.50c) 


In a si mi lar way we find p m ax • If 

i — k+l+r+j— 1 < i, 

r <k- j, 

then Pmax = J — 1. If not, then we need the biggest value of p max , such that 

i-k + l+r + Pmax < i, (24.52) 


(24.51a) 

(24.51b) 


which is Pmax — k — 1 — r. 

This concludes our walk-through of implementation details, and we can now show the pseudocode of 
the entire algorithm. In Figure 24.2 pseudocode is shown for initializing the M- -table; this initialization is 
used by the algorithm for computing the derivatives shown in Figure 24.3. 

24.2 The B-Spline 

Having taken care of the basis functions, we arc now ready for the actual B-spline. 

Definition 24.2 

A nonperiodic, nonuniform, k-order B-spline is defined by 


C(u ) = ]T N itk Pi 

i =0 


(24.53) 


© 


© 


© 


© 
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algorithm initializeM(i,u,K,T) 
left = array(K+l) 
right = array(K+l) 

M = array(K,K) 

left [ 0] = u - T[i + 1] 

right [0] = T[i] - u 

left [ 1] = u - T[1] 

right[1] = T [i + 1] - u 
M [0] [0] = 1 

For k = 2 to K do 

left[k] = u - T[i+l-k] 
right[k] = T[i+k] - u 

saved = 0 

For r=0 to k-2 

M[k-1][r] = (right[r+1]+left[k-r-1]) 
tmp = M[r][k-2]/M[k-l][r] 

M[r] [k—1] = saved + right[r + 1]*tmp 
saved = left[k-r-1]*tmp; 

Next r 

M[k-1][k-1] = saved 
Next k 
return M 
end algorithm 


Figure 24.2: Initialization of M-table. 


where 


a < u < b 


(24.54) 


The n + 1 points {Pi\i = 0,..., n} are called the control points and the functions = 0,..., n} 

are the k’th order normalized basis functions, which were treated in the previous sections. 


These are defined on a knot vector T, such that 

T = [a, ...,a,t k ,...,t n ,b, ...,b] T . (24.55) 

k k 

Notice that the knot vector has a total of n + k + 1 elements and the first ^-elements all have the same 
value and the last k elements also have the same value. Succeeding values in the knot vector should be 
nondecreasing. 


© 


© 
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algorithm derivatives(i,u,K,T,J) 

D = array (J+1,K) // D[j] [r] = j'th derivative of Ni-k+i+r,k 

a = array(2,K) 

M = initializeM(i,u,K,T) 

For r=0 to K-l 

D [ 0 ] [r] = M[r] [K-l] 

Next r 

For r=0 to K-l 

si = 0, s2 = 1, a [0] [0] = 1 
For j=l to J 
djN = 0 

If r >= j then 

a[s2][0] = a[si][0]/M[K-j][r-j] 
djN += a[s2] [0] * M[r-j] [K-j-1] 

End if 

If r + 1 >= j then pMin = 1 
Else pMin = j-r End if 
If r <= K -j then pMax = j - 1 
Else pMax = K - 1 -r End if 
For p=pMin to pMax 

a[s2] [p] = (a[si] [p]-a[sl] [p-1])/M[K-j] [r+p-j] 
djN+=a[s2][p]*M[r+p-j][K-j-1] 

Next p 

If r <= (K-l-j) then 

a[s2][j] = -a[sl][K-l]/M[K-j][r] 
djN += a[s2][j] * M[r][K-j-1] 

End if 

D[j][r] = djN 
swap(si,s2) 

Next j 
Next r 

factor = K-l; 

For j=l to J 

For r=0 to K-l 

D[j][r] = factor * D[j][r] 

Next r 

factor = factor * (K-l-j) 

next J 
return D 
end algorithm 


Figure 24.3: Pseudocode for computing derivatives of the B-spline basis functions. 
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algorithm curvePoint(u,K,T,P) 

i = knot index, such that ti < u < ti +1 
N = BasisFuncs(i,u,K,T) 

C = 0 

For r=0 to K-l 

C = C + N[r]*P[i-K+l+r] 

Next r 
return C 
end algorithm 


Figure 24.4: Pseudocode for computing a point on a B-spline. 


To obtain the derivative of a B-spline, we differentiate (24.53) with respect to u. By the product rule 
we get 

n 

c\u ) = ]T N' tk Pi + N hk p' v (24.56) 

*=o 

Since {Pi} arc all constants and independent of u, this reduces to 

n 

C'(u) = J2 N’, k Pi. (24.57) 

i=0 

From this, it is easily seen that 

n 

C (j \u) = J2 N{j \i,k)Pi. (24.58) 

i =0 

Recalling our very first algorithm for computing the basis functions discussed in Section 24.1.1, we 
can easily derive an algorithm for computing a single point on a B-spline. This is shown in Figure 24.4. 
From (24.58) it is not hard to see how we can put our previous algorithm for computing the derivatives of 
the basis functions (see Section 24.1.3) to practical use. This is shown in Figure 24.5. 

24.3 Global Interpolation 

In this section, we will look at the inverse problem of computing points on a B-spline. Instead, we will 
try to compute the B-spline passing through a set of points on it. The approach we will take is called 
global interpolation. We call the initial given set of points for break points or data points and introduce 
the notation {Tf ,} for them. The indices have the meaning that if j < i. then we should encounter X j 
before X , if we walk along the resulting spline as the parameter u increases. 

With the introduced formalism, we can state our task at hand quite simply: given a set of break points 
and a knot vector, we need to find the control points {Pi} of the nonuniform B-spline, which passes 
through all the break points. Assume that we have a knot vector consisting of the following knot values 

Z()j > ^2) • • • j tk—li tki • ■ ■ j tfii i'l)-■ 1 j tn+2 j ■ ■ ■ j tn+ki (24.59) 
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algorithm curveDerivatives(u,K,T,P,J) 
i = knot index, such that ti < u < ti+i 
J = Min(J,K—1) 
dC = array(J+l) 
dN = derivatives(i,u,K,T,J) 

For j-0 to J 
dC[j] = 0 
For r=0 to K-l 

dC[j] = dC[j] + N[j][r]*Pli-K+l+r] 
Next r 
Next j 
return dC 
end algorithm 


Figure 24.5: Pseudocode for computing the derivative of a B-spline. 


where 

to — ■■ ■ ■ 

^k— In 

(24.60) 

and 

tn+ 1 = • • • 

t"n+k' 

(24.61) 

Given such a knot vector, we know that 

C(t fc _ t) 

= Po, 

(24.62a) 


C(t n+ 1) 

= Pn • 

(24.62b) 


Each (unique) knot value will correspond to a single break point. From this we see that we must have a 
total of n — k + 3 break points: 


X 0 ,X 1 ,X 2 ,... , X n _k+i,X n -k+2- (24.63) 

For each of these break points we will require that 

C(ti + k-i) = Xi for 0<i<n — k + 2. (24.64) 

That is, we have a total of n — k + 3 equations, but we have n + 1 unknowns, and a spline with n + k + 1 
knot values must have n + 1 control points. In short, we might need a few more equations in order to 
solve our problem. Typically k = 4; this results in a fairly inexpensive spline computation with acceptable 
flexibility. When k = 4 two more equations arc needed to solve for the control points. We could simply 
pick 

Pq = Pi, and P n = P n _ 1 . (24.65) 

as the two extra equations. Now we can solve the system of linear equations given by (24.64). This 
strategy could be applied for an arbitrary value of k. However, we choose to work with k = 4; the reader 


■e 
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N i 


,4 







Figure 24.6: The computation of the basis function. 


should be able to generalize from our derivations. Let us set the value of k to 4 and see if we can derive 
an efficient way of solving (24.41). We start out by writing up Cf/j+s) as 


C(t l+3 ) = Y PiNi , 4 . (24.66) 

i =0 


Only k = 4 of the basis functions arc potentially nonzero, since they have a support that overlaps with 
ti+ 3 . That is, 


C(ti+s) — PiNi,A — P )/Yi,4 + -Pj+l/Vi+1,4 + Pi+'2Ni+2,4 + ^+3-^+3,4- (24.67) 

i=0 


Now let us examine each of the 4 iV-terms. We start by looking at By our definition of a basis 

function (see (24.1)) we can construct the tree shown in Figure 24.6, which shows how N t ^ is computed. 
Since we know that should be evaluated at t t+ s, we can easily write up an explicit expression for 
Ni, 4 . From the tree in Figure 24.6 we see that it would be the terms along the path from the rightmost leaf 
up to the root of the tree, because 7Vj + 3,1 = 1 is the only nonzero leaf of the tree. 


N 4 = 


Zf+4 /i+3 \ ( /i+4 /i+3 


ti+4 ~ t 


i +1 


ti +4 /i+2 


f iti+4 ti+ 3 ) AT 

77 - 7 — 7 yv *+ 3 ’ 1 

\(ri+4 — H+ 3) 


(#1+4 /i+ 3 ) 


(ti+4 /i+l)(/i+4 /»+ 2 ) 


(24.68a) 

(24.68b) 
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If we draw si mi lar frees for iVj + 2 , 4 , and A/,;+ 3.4 and evaluate them at t, +: >, we can derive the 

remaining expressions we need, 


A/i+ 1,4 

Ni+ 2,4 
A^i+ 3,4 


(/j+5 Zi+3)(Zi+3 /i+2) 

(/i+5 /j+2)(/i+4 2 ) 

_ (/j+3 ~ /i+2) 2 _ 

(Zi+5 /j+2)(/i+4 /*+ 2 ) 


(Z/+3 /j+l)(/i+4 /i+ 3 ) 
(/i+4 /i+l)(/i+4 /«+ 2 ) 


(24.69a) 

(24.69b) 

(24.69c) 


Now let us write up all the equations that determine the break points. 


where 


i = 0 

: C(t 3 ) 

= p 1 

= *o, 


i = 1 

: C(U) 

= a\P\ + P\P 2 + 7t-f*3 

= Xi, 


i = i 

: C(t i+3 ) 

= OL%P i + (3iP )+1 + 'JiP i+2 

= Xi, 

(24.70) 

i = n — 2 

: C(t n + i) 

= Pn -1 

C4 

1 

e 

* 

ii 



OLi — 

A = 

li = 


(/*+4 ti+ 3 ) 


(/i+4 /j+l)(/i+4 /«+ 2 ) 

(/*+5 /*+3)(/i+3 /t+ 2 ) (/*+3 Zi+l)(/i+4 /j+ 3 ) 

(/i+5 /j+2)(/i+4 /i+ 2 ) (# 1+4 /j+l)(/i+4 /j+ 2 ) 

(/j+3 - ti+2) 2 


(/*+5 /*+2)(/i+4 /j+ 2 ) 

All these n — 1 equations can be expressed in matrix form, as shown here: 


(24.71a) 

(24.71b) 

(24.71c) 


' 1 


' Pi ' 


1- 

O 

* 

1 _ 

at Pi 7i 





On—3 Pn— 3 7n—3 





1 


1 

73 

1 

1 _ 


3 

to 

1 _ 


(24.72) 


Looking at this matrix equation, we observe that it has a particularly appealing shape. Such shape is called 
tridiagonal, and it is particularly easy to solve a system of tridiagonal linear equations [Press et al., 1999b] 
and by doing so we can compute all the control points. 

However, we have omitted a small detail: how should the knot vector be constructed if we are only 
given the break points? A good approach to this problem is to use a heuristic, which is known as the chord 
length heuristic (see [Watt et al., 1992]). We will require that the following condition is always fulfilled: 


/i+4 ti+3 _ H-^i+1 ACj|| 

/i+5 ti +4 ||AC.j_|_2 


(24.73) 
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24.4 Cubic Curve Decomposition 

B-splines arc attractive due to two properties. They have local support and they guarantee continuity 
across adjacent curve segments. These properties arc not mutually present when having a composition of 
Bezier curve segments. Changing a control point on one curve segment means that one has to change the 
control points on all other curve segments to ensure continuity. However, a Bezier curve composition is 
far better to work with for rendering and evaluation. In our opinion it pays off in terms of performance 
to convert a B-spline to a composition of Bezier curve segments when one is finished with modeling the 
final shape of the spline. 

Nonuniform B-splines arc easily converted into a composition of corresponding Bezier curve segments 
[Hoschek et al., 1993] by performing an operation on them, which is called knot insertion. We have 
devoted this section to treat all the details about how one takes a nonuniform B-spline and ends up with a 
composition of cubic Bezier curve segments. 


24.4.1 The Cubic Bezier Curve and the B-Spline Connection 

The best way to see how a cubic nonuniform B-spline corresponds to a cubic Bezier curve is by direct 
computation. Assume that we have the nonuniform B-spline 

3 

C(u) = Y, PiNi , 4 , (24.74) 

i =0 

defined on the knot vector 

T= [ 0 , 0 , 0 , 0 , 1 , 1 , 1 , 1 ]' T . (24.75) 

We say that such a knot vector has no internal knots and that the multiplicity of all its knots is k = 4. We 
immediately see that only four basis functions arc involved. These arc: 


^0,4j-(Vi j 4,iV2 ) 4,iV3 ) 4. (24.76) 

Looking at Figure 24.6 it is fairly easy to evaluate these four basis functions. We can see that only those 
paths with leaves where the numerator t :j+ \ — tj arc nonzero contributes to the value of the A r -function. 
By direct substitution, we derive: 


No,4 = (1 — u ) 3 , 

(24.77a) 

iVi ,4 = 3 u (1 — u) 2 , 

(24.77b) 

iV 2,4 = 3It 2 (1 — u) , 

(24.77c) 

CO ’ 

S3 

II 

(24.77d) 
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Expansion of terms yield: 


iVo ,4 = — u 3 + 3 u 2 — 3u + 1, (24.78a) 

N 1>4 = 3 u 3 - 6 u 2 + 3 u, (24.78b) 

N 2A = ~3u 3 + 3u 2 , (24.78c) 

N 3i 4 = u 3 . (24.78d) 


Plugging our results into the original summation for the B-spline and using matrix notation we derive 


C{u) = u 3 , u 2 ,u, 1 


'-1 3 -3 l' 


1- 

O 

&H 

1_ 

3-630 


Pi 

-3 3 0 0 


Pi 

10 0 0 


-1 

CO 

Oh 
_1 


(24.79) 


But this is actually the matrix definition of a cubic Bezier curve. There is no doubt by now that there 
definitely is a connection between the spline and Bezier curve representations. However, there is a small 
complication: not all nonuniform B-splines have the particularly nice looking knot vector as we used to 
show the connection with Bezier curves. What should we do about a general looking B-spline? For now, 
we assume that every knot value in the general B-spline has multiplicity k (in the next section we will start 
on attacking the problem of twisting the spline into this form). The general cubic nonuniform B-spline 
can then be seen as a sequence of separate nonuniform B-splines, because at each unique knot value the 
first k — 1 derivative vanishes (see Section 24.6.1). Each sequence of this special looking spline looks 
almost like the one we used in our derivation above. The only difference is that the knot values arc not 
zeroes and ones. Instead, the knot vector looks like this: 


T — [/ii t j, t j, tj, , /j-)-l, /j-)-l, ] 


(24.80) 


The indices arc the ones from the original spline with multiplicity 1. We can easily get the zeroes by 
subtracting the left knot value from all the knot values. This is perfectly legal because we arc looking at a 
local segment of the spline. We end up with a knot vector like the following: 

T = [0,0,0,0,s,s,s,s] T , (24.81) 


where 

s = t i+ 1 - ti . (24.82) 

This knot vector almost looks like the one we want except for the s- value, which is not necessarily 1. If 
we repeat the above computation, we will derive at a cubic Bezier curve with similar parameter scaled 
by s. We can conclude that we still have the connection between cubic Bezier curves and nonuniform 
B-splines. 

Let us tty to get rid of this scaling problem. We introduce the terminology of global and local param¬ 
eters. The global parameter is denoted by U and it is the parameter that was used to define the original 
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nonuniform B-spline with (see (24.1)). We now use u for the local parameter of the f th curve segment 
B (u) of the original spline. We want 

B(u) = C{U). (24.83) 

If we compute u as follows: 

u = f(U), (24.84) 


where 


m = 


u-u 


ti -\-1 ti 

then our property is fulfilled. Another nice property is that u runs from 0 to 1. 


(24.85) 


24.4.2 Global and Local Parameter Conversion of Derivatives 

Given a local Bezier segment B L (u) of C(U) where we know that 

C(U) = Bi(u). 

We want to derive a conversion rule for the derivatives of the curve segments. 
By straightforward differentiation we get 

^C(C) = = B'(f(U))f'(U), 


where 


m = 


ti+i-u 

Putting it together we get the conversion rule for the first-order derivative we we’re looking for. 

C'(U) = -J—.B'mU)) = —^—B^u). 

^ 2+1 T'i H -\-1 £i 

If we continue our differentiation, we get 

-§pC(U) = ±B'(f(U))f(U) = B’lumauf + B'(f(U))f"(U ), 
and we can easily compute 


f"(U) = 


d ( 1 


' v ' dU\t i+1 -hJ 

Substituting back into our computation, we get the conversion rule we were looking for. 


C"(U) = 


B'ium = 


KU+I-UJ " \ti+i-ti 

By now the pattern should be obvious and we can conclude 


B'liu). 


(24.86) 


(24.87) 


(24.88) 


(24.89) 


(24.90) 


(24.91) 


(24.92) 


c(j) (U) = (r^—r) J B?\f(U)) = (-^—-yB\ j \u). 

\^2+l ti / \H+1 H/ 


(24.93) 
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24.4.3 Knot Insertion 


We arc now ready to tackle the task of knot insertion. Even though the resulting algorithm is quite simple 
and resembles the de Boor algorithm discussed in Section 24.7, we will need a number of preliminary 
definitions and theorems. 

Definition 24.3 (Divided Differences) 

Assume we have a function g(t) and a knot vector 



T — [• • • > tii ti+li ■ ■ ■ ) ti+ki ■ ■ ■ ] ) 

(24.94) 

where 

Vi . ti A ti- (_i < 

(24.95) 

We define the divided difference 
is given by 

on g recursively 1 in the following manner. 

The zero ’th divided difference 


> 

Si 

II 

to 

cS' + ' 

(24.96) 

and for k = 1 

A i+ i g(ti + 1) - A ig(ti) 

H-\-1 ^ i 

(24.97) 


The general k ’th divided difference 2 is defined by 


A i,...,i+k9 — 





(24.98) 


An important property of the divided difference, which we arc going to use, is that it is symmetric. This is 
easily shown. It follows trivially in the case of h = 0. Using operator notation and assuming the k — l’th 
divided difference is symmetric for all k > 2 , we notice that 


A _ ^i+l,...,v,...,w,...,i+k ^i,...,v,...,w,...,i+k— 1 

U+fc u 

_ ^i+l,...,w,...,v,...,i+k ^i,...,w,...,v,...,i+k— 1 

U+fc — ti 
^i,...,w,...,v,...,i+k- 

This clearly shows that the divided difference is symmetric. 

Theorem 24.3 (Leibniz’ Formula) 

If we have f(t ) = g{t)h{t) then Leibniz’ formula states 


(24.99a) 

(24.99b) 

(24.99c) 


i-\-k 

A = ^2 A i,-,r9^r,...,i+kh. (24.100) 

r=i 

'There are other ways to define the divided difference (see [Boor, 1978]). 

2 We have ignored the case ti = ■ ■ ■ = ti+k (see [Boor, 1978] for details). 
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The proof requires another definition of the divided difference The interested reader should refer to 
[Boor, 1978] for a proof. 

Lemma 24.4 (Divided Difference Relation) 

Given divided differences A; _ A ^...j+k-ijy and A yj+i,...,*+& the following holds: 

0 — (ti fi+fe) ^i,...,i+k T (fj i“i)^i,...,i+k— l J T (ti+k (24.101) 


Proof of Lemma 24.4: 

For now, we will try to show that this funny equation is in fact true. We’ll start out by using the definition 
of the divided differences on the funny (24.101): 


0 — (ti ^i+fc)Aj j ...j+k T (tj tf) 
T ifi+k — tj ) 


H+l,...,i+k—l,j 


— A. 


i,...,i+k— 1 




Aj+q.-.y+fc ...,j+fe_i 


(24.102) 


ti+k tj 


Cleaning up gives us 

0 A i t ... t i-\-k 


H+l,...,i+k—l,j 


— i.i — A 


•j,i+l,...,i+k— 1 


A 


i+l,...,i+k 


-A 


i,...,i+k~ 1 


ti ti+k 


ti+k ti 


By symmetry of the divided difference we have: 


(24.103) 


^i+l,...,i+k— lj Aj ) j_|_i ) 


(24.104) 


so our second term vanishes from our equation. Now looking at the third term we recognize the definition 
of the divided difference, that is, 


H,...,i+k — 


^z+l,...,i+fc A. 


i,...,i+k~ 1 


ti+k ti 


(24.105) 


So the first and third terms cancel each other out, and we immediately see that (24.101) is true. □ 


Definition 24.4 (The Ordinary B-Spline Basis Function) 

The unnormalized k’th order B-spline basis function M % also called the ordinary B-spline basis function, 

is defined recursively. This is very similar to (24.1). We have, for k > 1: 


and for k = 1: 


Mij~ 


U-tj 
ti+k ti 


Mi t k -1 + 


ti+k 11 
ti+k ti 


Mi+l,k-l, 


M. l1 = 


ti -\-1 ti 

0 


if 1 1 <u< t i+ i, 
otherwise. 


(24.106) 


(24.107) 


The ordinary B-spline basis function can be turned into the normalized B-spline basis function using: 
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Theorem 24.5 (Relation between Ordinary and Normalized B-Spline Bases) 

An ordinary B-Spline basis M may be converted into a normalized B-spline N using 

Ni,k = ( U+k ~ (24.108) 


This is all easily verified by straightforward computation and comparison with the definition of the nor¬ 
malized B-spline basis function (see (24.1)). 

We will now show how to obtain an Ordinary B-spline using divided differences. 


Theorem 24.6 (Ordinary B-Spline from Divided Differences) 

Given a known function 3 g, 


9(t ) 


0 ift < u, 

(t — «) fc_1 ift > u, 


(24.109) 


then the k’th divided difference corresponds to the ordinary k’th order B-spline basis function as 


M itk = A,.,.((t - «) fc_1 ) >u • (24.110) 

We have adopted the notation ((t — u) k ~ 3 ) >u to mean (t — w) fc_1 ift > u and 0 other-wise. We will now 
show this by induction 4 on k . 

Proof of Theorem 24.6: 

First we observe that for k = 1 we have 


Now if u < tj then 


M u = A m+ i ((f - m)°) >u 

_ A m {(t ~ U )°) >u ~ ((* - «)°) 


>u 


M u = 


If U < u < t l+ \ then 

And finally, if u > t l+ \ then 


M u = 


tiy 1 tj, 

= ((ti+i -m)°) > u ~ ((*i-«)°) >u 

(tj +1 - u)° - (tj - u)° _ 1-1 

1 ti ti-\. i t{ 

(ti+1 - U)° - 0 1-0 1 


= 0 . 


tj +1 tj tj +1 tj /*+1 tj 

0-0 


M u = 


tj +1 tj 


= 0 . 


(24.111a) 

(24.111b) 

(24.111c) 

(24.112) 

(24.113) 

(24.114) 


’Notice we have defined g differently than in [Boor, 1978, Boehm, 1980]. 

4 Note that if we use the definition of g in [Boor, 1978, Boehm, 1980], then the base case will always fail. 
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So we can conclude that (24.110) is true when k = 1. Now let us assume that (24.110) is true for k = k— 1 
and examine 

= (t — u) ^(t — u) k ~ 2 ^j . (24.115) 

We can think of this as the product of two functions, so let’s try to use Leibniz’ formula (24.100) on it. 

\,...,i+k ((/ - U ) k l ) >u = ~ ((t - u ) k ~ 2 ) >a 

+ A i)i+ i (t - u )((t - U ) k ~ 2 ) >u 
+ A i,...,i+2(t - u)A i+2 ,...,i+k ( (t - 


+ A i t ..., i+ k(t - u)A i+k (jt - u) k 2 ) 


Now, by straightforward computation we have 

A i(t - u) = (ti - u ) 

A u N (U+i -u)~ (ti - u ) 

Aij + i(t - u) =-----= 1 

H+l H 


(24.116) 


(24.117a) 


(24.117b) 


A u ^i+l,i +2 (t u) A ii+ i(t u) 11 

V U+2-U t i+2 -U 


(24.117c) 


A u) — 0. 

Using these results (24.116) reduces to 


(24.117d) 


A *.i+fc ((/ - u)*" 1 ) = (ti-u)Ai_ i+k ((t - u) k ~ 2 ) +A i+1 .„., i+k ((t - u) k ~ 2 ) . (24.118) 

V / >u \ / >u \ / >u 


By the definition of the k ’th divided difference we have 


H,...,i-\-k — 


H-\-l ,...,i-\-k t-±i,...,i-\-k—l 

ti+k H 


Multiplying by (ti — u ) gives 


( \ A (ti ^Z*) /A A \ 

(ti 'U / )^i,...,i-\-k — 77 ~T ^i,...,i+k— l) • 

V'i+k H) 


(24.119) 


(24.120) 
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Substituting this into (24.118) yields 

A ({t - u)^) = 

(t ‘ ~ “j - (a, +1 . i+l (ft - u)‘- 2 ) -A..**_,((»-„)*-*) ) 

(ti+fc — ti) \ \ / >u \ / > 22 / 

+ A i+1 ,... )i+fc ((t - n) fc - 2 ) . (24.121) 


Now, by induction, this means that we have 

M i,k = ^ ~ U ] t (M i+ i, fc _i - + M*!,*.!. (24.122) 

VW+/c W J 

Reainanging the terms, this equation can be rewritten to 

M t . k = , U ~*\ Mi jk _i + ^ +k ~_ U M i+l)k _ L (24.123) 

H+fc ^2 ^2+fc H 

But this is the definition of M, /,,, which means that we have proven the correspondence of M l k and the 
k 'th divided difference. □ 


Finally, we arc ready to make the knot insertion algorithm. Imagine that we insert a new knot value, 
t*, into the knot vector, T, where 

t r <t* < t r +i. (24.124) 

Then we get a new knot vector 

T = t r+k ,...] T . (24.125) 


Observe that 


U 


ti if 7 < r, 

< t* if i = r + 1 , 
U -1 if * > r + 1 . 


(24.126) 


Given a nonuniform B-spline, C(u), defined on T, we want to find a new nonuniform B-spline, C(u), 
defined on T, such that 


C(u) = C(u ) 


(24.127) 


for all values of u. If we look at the definition of the nonuniform B-spline, this means that we have to find 
all Pi s such that 

n n+1 

Y, N i, k p i = Y,N itk Pi’ ( 24 - 128 ) 

2=0 2=0 


© 


© 


© 


© 
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where the hat-notation means that the quantity is defined with respect to T. Recall that N t j. has support 
on the knot values t l+ k- This means that t* only affects the basis functions 

N r -k+i,k, ■ ■ ■ 5 N rt k. (24.129) 

Since C(u) and C{u) should have the same shape we can conclude that 


Ni = 



if i < r — k + 1, 
if i > r + 1 , 


from which it follows that we must have 


Pi 


Pi if / < r — k + 1, 
Pi-i if i > r + 1 . 


This leaves us with the problem of determining the P,s such that 


r r+1 

N i,kPi = ^2 N itk Pi. 

i =r -k -\-1 i=r-k -\-1 


(24.130) 


(24.131) 


(24.132) 


To solve this problem we will try to write N in terms of N. To accomplish this we will turn back to the 
funny (24.101). Let’s set tj = t* and rearrange the equation. Then we have 

(ti+k — ti)^i,...,i+k = (t — 1,* T ( ti+k ~ t )^*,i+l,...,i+fe- (24.133) 

Using (24.110) we get, for all values of i = r — k + 1 to r: 

0 ti+k ~ ti)Mi, k = (: t* - ti)M iik + (U+k ~ t*)M i+ i, k - (24.134) 


We’ll convert the M- -terms into iV-terms by using (24.108) 


Nik = 


t*-tj 
ti+k t't 


Ni^k + 


U+k -1* 
U+k+1 U+1 


-N 




(24.135) 


Now we want to change notation from old knot values into new knot values. We can do this by observing 
that for any value of i = r — k + 1 to r, we have: 


So we end up with 


ti ti and tj-i-fc ti-^k+i- 


t* - U - ti +k +i - t* ~ 

N+ k = - - l -Ni,k + f + + f N i+1>k . 


ti+k U 


U+k+1 U-\-1 


(24.136) 


(24.137) 
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We use this to rewrite the terms in the summation of C(u), that is, 


E N <* p ‘= E 


i=r-k-\-l 


i=r—k +1 


t* - u 


ti+k+l t 


ti+k ti ^i+k+1 ti+1 


^i+l,k ) Pi- 


(24.138) 


Expanding the summation gives 


y! n i,kPi 


i=r—k +1 


' | t r+2 ~t* \ 

? 2 ™r-k-\-l,k '? 2 lvr— fc+2,/c I r—k+l 

Tr+1 — ^r—k+l t r -\-2 — ^r—k+2 / 

+ ( /"V" tir-M-V. + i — J *' P ,- l+ 2 

Vri+2 0—fc+2 ®r+3 ri—fc+3 / 


+ ( y y jv rifc + u+k+i y 

\tr-\-k ^r+fc+1 ^r+1 


-^r+l,/c ) -^V* 


(24.139) 


With the knowledge that t* = £ r + 1, we immediately see that some of the fractions in the summation 
become 1. This reduction results in 


i=r-k -\-1 


y Ni'k.Pi = ( N r ~k+I,k + j — r+2 f - N r _ k+2 , k ) p r -k +1 

' ri+2 — U-k+2 ' 


, ^ U-k+2 2 t r + 3 ** i\t \ p 

+ I J-J- iv r-fc+2,fc + J-J- lsl r-k+3,k I ^r-k+2 

Vri+2 — 0—fc-)_2 ri+3 — tr—k+3 / 


{ t X - t A \ 

+ ( a- j-Nrfi + (Vr+l,fc j P r . 


(24.140) 


We then collect terms of equal iV, to get 


y Ni.kP’i. = N r -k+i,kP 


r—k+1 


i=r—k +1 


I 7\V ^r+2 t* | t r —k+2 p 

+ - /v r-fc+2,fc I J-J- *r-fc+l + J-J- ^r-k+2 

\^t+2 tp—k+2 ^r+2 t r —k-\-2 


, ]\7 M r + fc p | t r p 

+ A V,fe I - -r-Pr -1 + -~ 

\ ‘'r+fc 

+ ^ r +l,kP r- 


(24.141) 



“book” — 2005/9/30 — 1^:44 — page 755 — #767 


24.4 PHYSICS-BASED ANIMATION 


Observe that we now have a summation over N t just like we wanted for (24.132). All we need now is to 
make the formulas look nicer. We do this by first expressing the fractions in terms of T and not T. 


yi N^Pi = N r -k+i,k.p 


r—k+l 


i=r-k -\-1 


~ ( f r +1 — t* t* — t r -h+2 

+ N r _k + 2,k , + \ - P r—k+l + 7- +^P r -k+2 

\t r -\-l t r -k-\-2 W+l ^r—k+2 


* (t r +k -i — t* t* — t r 

+ Nr,k ; +k Pr-1 + 7-f -P, 

X^r+k—l ^r ^r+k—l 

+ N, r +l,fcP r- 


Finally, we will introduce the notation 


from which we observe that 


t*-U 
ti+k— 1 — li 


(1 - a.i) = 


tj+k-i ~ t* 
H+k—l — H 


(24.142) 


(24.143) 


(24.144) 


With the new notation at hand, we have 


yi Ni^Pi ~ N, 


r— &+!,&-* r—k +1 


i=r—k+l 


+ N r —k+2,k ((1 — k+ 2) Pr—k+1 + fc+2-f*r—fc+2) 

+ ((1 — a r) P 1 — 1 + a r P' r ) 

"F -^r+l,kPr ■ 

From this we conclude that the solution to (24.132) is 

Pj if i = r — k + 1, 

Pi = < (1 - a*) Pj_i + aiPi if r - k + 2 < * < r, 

Pi-i if * = r + 1. 

Putting it all together, we can now state the knot insertion algorithm. We have 


(24.145) 


(24.146) 


C(u) = J2 Ni, k Pu 


(24.147) 
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algorithm lookupSegment(s) 
int i=0; 
do { 
i + +; 

}while(ArcLengthTable[i ] <s) ; 
i—; 

return i; 
end algorithm 


Figure 24.7: Pseudocode for linear search of segment corresponding to a given arc length value. 


with 


Pi = (1 - Pi-i + otiPi , 


(24.148) 


where 


'1 

t*—U 
ti+fc — 1 0 

0 


if i < r — k + 1, 
ifr — k + 2 < i < r, 

if i > r + 1. 


(24.149) 


If we compare the algorithm with the de Boor algorithm (see Section 24.7), then we will discover that the 
newly computed control points correspond to the de Boor points P) . P l r \,..., P l r _ k+2 . 


24.5 Accumulated Arc Length Table 

In the previous section we explained how the spline could be decomposed into a sequence of Bezier 
curves. Sometimes we arc interested in quickly finding the Bezier curve segment that corresponds to a 
given arc length value along the spline C{u ); for instance, if we do spline driven animation. 

Stated mathematically, we arc given an arc length value s and we arc now looking for the Bezier curve 
segment B L , such that 

S{U) < s < S{t i+ 1 ). (24.150) 

One way to quickly determine the curve segment is by using an accumulated arc length table. After 
having found the decomposition of the spline into Bezier curve segments. Bo,, B n - 1 , this table is 
easily constructed by computing the total arc length of each Bezier curve segment S b, and letting entry i 
store the value Sb,■ We can now perform a linear search through the table to find the segment we 

arc looking for, or even better we can do a binary search. In Figure 24.7 pseudocode for a linear search 
of a segment is shown. This sort of strategy is also known under the name accumulated chord length and 
it comes in different flavors, but the basic idea still remains the same. For more details about the other 
flavors see [Watt et al., 1992]. 
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24.6 The Regular Cubic Nonuniform B-Spline 

In this section, we will look at a subclass of the B-splines, which is known as regular B-splines. A regular 
B-spline is defined by having nonzero first-order derivatives everywhere. Regular B-splines are useful for 
describing a physical meaningful motion. That is the center of mass trajectory of a real-world object (see, 
for instance, [Erleben et al., 2003a]). 

In this section, we will formulate different conditions on how one should define a B-spline such that 
it would be a regular B-spline. We will only consider cubic B-splines. 


24.6.1 First-Order Derivatives at Knot Values 

Let us derive an equation for the derivative at the knot value ti 

C'{ti) = N'_ 3A Pi_ 3 + N'_ 2A Pi_ 2 + N'_ 1A Pi_ i + N' A Pi 


(24.151a) 




i-3 


Ni- 3,3 “ 


+ 

+ 

+ 


ti+l ti —2 
3 

ti+2 ti—1 

3 


ti+l ti —2 

Ni- 2,3 - 


- Ni _5 


i—2,3 I -Pi-3 


JVi-1,3 - 


ti+2 ti—1 

3 


-Ns—-i 


i—1,3 I Pi-2 


tj+3 t 


■iVi.3 - 


ti+3 ti 

3 


/Vi , 3 I Pi- 


i—1 


ti+4 tj+i 


-iVi+1,3 I Pi- 


(24.151b) 


Recall that Ni- 3A = 0 and Vj +13 = 0 since we arc looking at the f th knot value of a third-order B-spline. 
Using this knowledge we can rewrite our equation into the following sum of differences 


C'(U ) = 


3/Vj— 2,3 

ti+l — ti —2 

3/Vj_i j3 


+ 

+ 


ti+2 tj_i 
3 /Vj 3 


(Pi.2 - P,_ 3 ) 
(Pi-r - Pi- 2 ) 


P - Pi- 


i-l; 


ti+3 t 

We’ll evaluate the basis functions by using de Cox’s definition(24.1); we start by looking at A+ 3 . 


(24.152) 


© 


Nt 


i, 3 


T^rNi? 
ti+2 ~ti 


H+3~U AT 

U +3 - U+1 Ni+h2 


© 

u—ti 

■Ni,! 

ti -\-1 

\ 

U+2—U 

Ni+1,1 

ti+2 — ti+l 

© 

U — ti + l 

Ni+1,1 

ti+2—ti+l 

\ 

U+3-U 

Ni+ 2,1 

ti+3~ti+2 
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© 
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Recall that only N^i is nonzero for u = ti. From this we see that N l ^ is always zero. Now let us look at 

Ni- 1,3 



u—ti— 1 7\r 

U+i-U-i^i- 1.2 

/ 

U — ti— 1 A7 




/ 


\ 

ti+l-U jy 

U+i-U 1 *4 

\ 

U+2-u n 

U+2—U 

/ 

AT- 

fi+i-ti ’- 1 



\ 

9+2—Y- , , , 



ti+2—ti+1 *+14 


Setting u = t-i, we see that 

i\ 

Finally, we only have to evaluate A7_2.3 


Ni- 1,3 = 


U—ti -2 


H H— 1 
ti +1 1 




/Vj-2,3 


* b 2 


Again we have u = ti, and from this we conclude that 




.AT. 

ti—ti—l ^ 1 




tj+l-U jy 

*4 


iVi-2,3 = /‘ +1 /' > 0. 

^ 2+1 1 


(24.153) 


(24.154) 


Substituting the basis functions we just found into our equation for the first-order derivative, we obtain 

, 3 ( Z?4-1 ti , . _ __ , ti ti —1 , __ __ , \ _ 


c'(u) = t ( , tl+1 ‘‘ - Pi- 3 ) + A ^r 1 (P,-, - p,_ 2 ; 

W+l 1 \^Z+1 2 W+2 ^2—1 


(24.155) 


If we wanted to make C\ti) equal to zero how could we then go about this? From our equation, we can 
by straightforward inspection derive the following complete list of conditions, which will make C (t t ) 
equal to zero. 


-Pi-1 = Pi-2 = Pi— 3, 


• Pi-i = Pi -2 and t i+ 1 = tu 

• Pi-2 = Pi-3 and ti = ti- 1 , 
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/*+1 — H — tj—1> 


• The final condition is a bit more difficult to derive 


0 = 


ti +1 ti 

ti +1 — /j—2 


(-Pi—2 - Pi- 3 ) + 


^2 ^2—1 
^ 2+2 ^ 2—1 


(-Pi-1 “ Pi-1 


From which we get 


(tj+1 /j) (/j+2 tj-l) ,p p 

(U +1 - 2) (ti - ti_ 1) l_3 


(Pi-1 ~ Pi-2 ) • 


(24.156) 


(24.157) 


This tells us that the three succeeding control points Pi- 3 , Pi- 2 , and P \ all have to lie on the 
same line and the magnitude of ( Pi -2 — Pi- 3 ) must be a scalar multiple of the magnitude of 

(Pi-^-Pi-2). 

With all this theory at our disposal we surely know how to avoid that C {t r ) ever becomes zero for any 
knot value. C'(ti) is always nonzero for any knot value ti if the following three criteria is fulfilled. 


1. No knot value has multiplicity greater than one. 

2. All control points arc unique. 

3. We never have three (or more) succeeding control points lying on the same line. 


Note that if we put this theory to practical use then it is slightly more restrictive than it needs to be; that 
is, it does exclude some splines having nonzero first-order derivatives. 


24.6.2 First-Order Derivatives between Knot Values 

By now we have complete knowledge about how we can control C'(u) at any knot value, ti, but what 
happens between the knot values? That is, how do we ensure that 

C'(u)£ 0 (24.158) 

for an u -value in between two knot values? That is, 


ti K. U <. ti -(_!• 

Let us start writing up the equation for C'(u) again 

c'(ti) = - m - 2 f (p t -2 - Pi- S 


ti+l ti —2 

3/Vj_i j3 


+ 

+ 


U -(-2 “ ti -1 

3/Vi 3 


(Pi-1 ~ Pi-2 ) 


ti-\- 3 ti 


Pi ~ Pi- 


i—lj 


(24.159) 


(24.160) 
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And like previously, we will now evaluate the basis functions. 


Ni- 2,3 
Ni- 1,3 


Ni 


i, 3 


(tj +1 ~ uf 

(^i+1 ^ 2 ) (^z+1 l) 

(tt + l-u) (u - ti) (ti+2 ~ 

(^ 2+1 ^ 2 ) (^ 2+1 ^ 2 — l ) (^ 2+1 ^ 2 ) (^ 2+2 ^ 2 ) 

(t* ~ tj) 2 
(fj+l /*) (fj+2 


From our knowledge of the knot vector 


t%— 1 ti U <C. / /—2 ■ 


(24.161a) 

(24.161b) 

(24.161c) 


(24.162) 


We can conclude that A/^3, A r ( _i,s, and W-2,3 are always positive and never zero. So our equation for the 
first-order derivative reduces to the equation 


C'{ti) = a ( Pi -2 - Pi- 3 ) 

+ b (Pi—l — Pi- 2 ) 
+ C (Pi ~ Pi- 1) 


where 


a = 


5 = 


c = 


3 (fj+i - u)' 


> 0 , 


(fj+l ti— 2 ) (fj +1 fi) (/j+l fj—l) 

3 / (4fi - tt) (tt - fj-i) (rr - tj) (t i+ 2 - u ) 


(^2+2 ti— l) \ (^2+1 ti) (ti-\-l ti— 1) (^2+1 ^2) (^2+2 ti) 

3 (u - U f > Q 


> 0 , 


(/»+3 /j) (Zj+1 2j) (fi+2 /f) 

All we have left to do is to solve the linear - system 

aAi -2 + bAi-i + cAi = 0 , 


(24.163) 

(24.164a) 

(24.164b) 

(24.164c) 

(24.165) 


where A, = P, — P \ and since we already have decided that none of the control points coincide we 
know that all A, are nonzero. In other words, we can only find a solution if the A , s are linear dependent. 
In three dimensions and higher it is quite easy to come up with a condition, which always ensures that 
C'(u ) is nonzero. Simply make sure that no four succeeding control points lie in the same plane. However, 
in two or one dimensions it is unavoidable to have linear dependent As, and the problem still persists in 
3D space (or higher) if one wants the spline (or a subpart of it) to lie in a plane. So let us tty to handle the 
lineal - dependence. 
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Two Linear Dependent As Recall that each a, b, and c is a function of u; actually they arc all second- 
order polynomials. So assuming we have two linear independent As at our disposal, then we have three 
possibilities. 


sAi + iAj_i = Aj_ 2 , (24.166a) 

rAj_2 + wAi = Aj_i, (24.166b) 

xAi_i + yAj_ 2 = Aj, (24.166c) 

for some nonzero scalar values s, t, v, w, x, and y. Now let us look closely at the first possibility. 

a(u) (sAi + fAj_i) +6(u)Aj_i + c(«)A; = 0, (24.167a) 


{a(u)s + c(u))Ai + ( a(u)t + b(u)) Aj_i = 0. 

Since A z and A,_i are linear independent and different from zero, this can only occur if 

(24.167b) 

a(u)s + c(u) 

1=0, 

(24.168a) 

a(u)t + b{u) 

1 = 0. 

(24.168b) 

Similarly, the other two possibilities give rise to 

b{u)v + a{u 

) = o, 

(24.169a) 

b(u)w + c(u 

) = o, 

(24.169b) 

and 

c(u)x + b(u] 

) = o, 

(24.170a) 

c{u)y + a(u] 

) =0. 

(24.170b) 


In other words, our problem has been reduced to determine if two parabola intersect and if they do then to 
determine the ?/-val ucs where they intersect. 

Three Linear Dependent A’s Following our derivation from the previous section, we have 

a(u)xAi + b(u)yAi + c(u)zAi = 0 (24.171) 

for some nonzero scalar values x, y, and z (all nonzero). All this boils down to a single equation. 

a(u)x + b(u)y + c(u)z = 0. (24.172) 

By now we arc tempted to repeat our earlier idea and look at the linear - dependency of the three polynomials 
a(u), b(u), and c(u). If they all are linear independent then no solutions exist and we are guaranteed that 
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C' (u) is always nonzero. If, on the other hand, we have some polynomials that arc linear - dependent, then 
we are once again left with determining if and where two parabola intersect. To see this, imagine we have 


then (24.172) becomes 


a(u ) = vb(u) + wc(u ), 


(24.173) 

a(u)x + b{u)y + c(u)z 

= 0, 

(24.174a) 




(■ vb(u ) + wc{u))x + b(u)y + c(u)z 

= o, 

(24.174b) 




(■vx + y)b(u ) + (wx + z)c(u) 

= 0, 

(24.174c) 




(yx + y) 

, , Mu) + c(u) 

(wx + Z) 

= 0 

(24.174d) 


just like we have seen previously. 

We can conclude that we can only have C'{u ) = 0 between /, and t l+ \ when two second-order 
polynomials in u have an intersection in the interval / i+ [ . In other words, the first derivative can only 
vanish at isolated values of u, that is, C'(u ) = 0 between knot values can only occur at cusps. 


24.7 The de Boor Algorithm 

In this section, we will state the de Boor algorithm and prove it. 

Theorem 24.7 (The de Boor algorithm) 

The de Boor algorithm can be stated as follows: 


n+j 


C(u) = ^2 N i,k-jP\ for 0 < j < k - 1, 

%—0 

(24.175) 

where 

Pl = {\- a^Pjfl + a iyj Pl~ l for j > 0, 

(24.176) 

and 

a,i,j = , and = Pi. 

T-i+k—j H 

(24.177) 

The points P] are called the de Boor points. 


Proof of Theorem 24.7: 

Now we will prove the de Boor algorithm. First, let us write up the definition of a nonuniform B-spline. 

n 

C{u) = Y J N i , k Pi. 

(24.178) 


i =0 
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If we now apply de Cox’s definition for N t j. then we get 


c(u)= x; 


U — tj 


( ._q H+fc— 1 tj 


Ni.k~\Pi + 


/i+fc 


i^O tj+1 


N, 




-i 


(24.179) 


This looks a bit nasty, but by applying a little mathematical trick of shifting the index of the second term 
by * = i — 1, we get 


C(u) = 


V 


u — tj 


n+l 


^_q ^i+&—1 ^2 


Ni,k-iPi + XZ 


ti+k— 1 tt 


ti 


_ ^ ti+fc—1 


Nj b—lPi—1 


(24.180) 


Finally, if we define P_i = 0 and P„+i = 0, then we are allowed to rewrite the two summations into a 
single summation 


P i -L j-j / 

C(») = £^ 

i=0 


/j) T Pi— 1 (/j+fe—1 w) 

ti+k— 1 ti 


Ni,k- 1- 


(24.181) 


Again, we apply a little mathematical trick of adding and subtracting tj from the second term in the 
denominator. That is, 


n+l 


c(u)= x: 


i=0 

n+l 


Pi {it ti) + Pi— i 1 ^ “1“ ll) 

1 H 


Ni,k -1 


E Pj (n — ti) + Pi_i — tj — (ti — f *)) 

-—-T-^.*-1 


i=0 

n+l 




X^(Pt++^-i(i-«t))^ 


i.k— I; 


(24.182) 


i=0 


where aj is defined as in the definition of the de Boor algorithm. The final equation is derived by intro¬ 
ducing the notation 


P\ = (1 ~ a}) Pi -1 + a]P t . 


(24.183) 


So we get 


n+l 

C(u) = Y J P}Ni, k - 1 . 


i =0 


(24.184) 


From all this we can conclude that we have proven the de Boor algorithm in the case where j = 1. It is 
not hai'd to see that if we apply the same index shifting and definitions recursively we will end up with the 
de Boor algorithm in the general j’th case. □ 
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24.8 Repeated Knot Insertion 

If our spline, C(U ), does not have any sequences of identical control points or knot values, then the local 
Bezier segment Bi(u ) corresponding to the segment i, to t l+ \ will have geometry vectors, Gq, G \, Go 
and G\ 3 , which are unique and distinct. This postulate follows rather easily from direct computation. 

Let us assume that we have a multiplicity of k = 4 for all knot values to the left of t r and that all knot 
values to the right of t r and t r itself have a multiplicity of 1. Now imagine we have 

t r <t*<t r+ 1 . (24.185) 


This means that for a cubic spline (i.e., k = 4) we have N r _ 3 , 4 ,..., N r> 4 / 0. From this we also know 
that it is the control points P r _ 3 , P r - 2 - P r - i> an d which have to be recomputed in the new spline. 
The new de Boor points arc generated as in the table below 


where 


\ 

(1 - a r - 2 ) 




\ 


P° r _ 2 -> 

a r -2 

-> Pj_ 2 


\ 

(1 - a r _i) 

\ 


Pr -1 

d r —1 

- Pi -1 


\ 

(1 - dr) 

\ 


pO 

j. r 

d r 

—P 1 


a r -2 = 

([tr t r — 2 ) 

(t r _l_l — t r —2 

) >0 ’ 

(24.186a) 

dr— 1 = 

([tp t r — l) 

(^r +2 — t r —l 

) >0 ' 

(24.186b) 

Qjy - 

(t r t r ) 
(tr +3 — /r) 

= 0 . 

(24.186c) 


Now we can construct the new control points P 


Po 

Pi 

P 2 ■ 

1 

CO 

pl 

r—2 

pl 

* r—1 

P 1 

x r 

Pr 

Pr+1 

Pn 

Po 

Pi 

P 2 . 

1 

CO 

Pr-2 

Pl -1 

Pr-1 

Pr 

Pr+1 

Pn 

Po 

Pi 

P 2 . 

• • Pr-3 

Pr-2 

Pr- 1 

Pr 

Pr+1 

Pr+2 

■ ■ P n+1 
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and the new knot vector T. 


to 

h ■ 

. . t r 

t* 

/r+1 • ■ 

■ • t"ri+k 

t-o 

h ■ 

t r 

t r 

/r+1 • ■ 

■ • t"n+k 

t-o 

h ■ 

. . t r 

/■r+1 

Zr+2 

tn+k+l 


We now have multiplicity of 2 for the knot value t r . Let’s try to insert t* = t r once more. This time we 
will discover that 

Z r +1 < t* < t r + 2 , (24.187) 

from which we conclude that it is the control points P r - 2 , P r - 1 > Pn and P r +i, which have to be 
recomputed in the new spline. 


where 


r—2 


(1 - a r - 1 ) 


r— 1 


&r—l 


r— 1 


(1 - a r ) 


(1 Q-r+l) 


r+1 


®r+1 


r+1 


d r — 1 — 


{tr tj — t) t r — 


d r — 


(/r+2 t r — t) /r+1 /j — 1 

{tr tr ) t r /' 


- > 0 , 


Or+1 — 


(/r+3 — /r) ^r+2 t r 

{tr if+l) /r /' 


(/ r _)_4 — /r+2 t r 


= 0 , 

= 0 . 


(24.188a) 

(24.188b) 

(24.188c) 


With this information we can write up the new control points P 
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Po ■ 

, . p r - 3 

P 1—2 

Pr -1 

P 1 

J. j. 

p 1 

r+l 

P r +1 

• • P n +1 

Po ■■ 

, . P1 —3 

P^r —2 

Pj —1 

*r—l 


P r . 

Pn 

Po ■■ 

, . P1 —3 

Pi -2 

D 1 

r—1 

Pi -1 

Pr— 1 

P r . 

Pn 

Po .. 

, . P t —3 

Pr -2 

Pr -1 

P r 

-Pr+1 

Pr +2 ■ 

■ ■ Pn +2 


and the new knot vector T 


to 

tl . 

.. t r 

/r+l 

/* 

tr+2 

■ • Zn+fc+1 

to 

t-l . 

t r 

t r 

t r 

/r+l 

■ • 

to 

tl . 

.. t r 

/r+l 

tr-\-2 

/r+3 

■ • /n+fc+2 


We now have a multiplicity of 3 for the knot value t r . Let’s see what happens upon the last knot insertion. 
First, we see that we have. 

/r+2 <t*< t r+ 3 , (24.189) 


from which we know that we have to recompute the following control points. 


r— 1 


where 


(1 - a r ) 


r+l 


r+2 


(1 Q-r+1) 

®r+l 

(1 — a r+ 2 ) 

O-r+2 


r+l 


r+2 


Cty' - 

Clr+1 = 

Clr+2 = 


(tr tr) tr tr 


(t-r+3 t r ) /r+l t r 
(t'j* /j*_(_]_) t'r t^ 


(tr+4 /r+l) /r+2 # 

(/r /r+ 2 ) /r /r 


= 0 , 

= 0 , 


( t-r+5 /r+ 2 ) /r+3 t r 


= 0 . 


(24.190a) 

(24.190b) 

(24.190c) 


Finally, we write up the new control points p. 
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Po 

■ ■ Pr- 3 

Pi—2 

P r —1 

P' 

Pr+l 

p 1 

-*r +2 

Pr +2 • 

• • P n+2 

Po . 

• • Pi—3 

P ©2 

P r _l 

Pr —1 

Pr 

Pr+l 

Pr 

• P„ 

Po 

• • Pr- 3 

p 1 

r r-2 

P 1 

r r-l 

P 1 

r r-l 

P 1 

* r— 1 

P 1 — 1 

Pr 

• P„ 

Po ■ 

Pr- 3 

Pr-2 

Pi—1 

Pr 

Pr+l 

Pr +2 

Pr+3 

Pn+3 


And the new knot vector r. 


/o 

. . t r 

tr+ 1 

tr+2 

t* 

P+3 

■ • Pi+fc+2 

to 

t r 

t r 

t r 

t r 

^r+1 • ■ 

■ • 

TO • 

. . T r 

Tr+1 

A+2 

Tr+3 

Tr+4 

Tn+fc+3 


Now let us examine our results a little closer. We have now discovered that only three new control points 

arc actually computed: P l r _ 2 , P r _i- and P , 1 _ |, which arc all distinct. 

- 1 9 

Notice that P rl = P~ ,. This means the new control points actually can be computed directly by 
using the de Boor algorithm (see Chapter 24.7) on P r _ 3 , P r _ 2 , and P r —i- 


pO 

-*i—3 

\ 



pO 

-*i—2 

\ 

P 1 

-*7—2 

\ 

pO 

-*i—1 

-> 

pi 

* r— 1 

1 

M fr. 

T 
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Software: OpenTissue 


OpenTissue is an open-source library for physics-based animation developed mainly by the Computer 
Graphics Group at the Department of Computer Science, University of Copenhagen. 

25.1 Background History of OpenTissue 

The history of OpenTissue goes back to November 2001, where a small group of people: K. Erleben, H. 
Dohlmann, J. Spoiling, and K. Henriksen started to collect a toolbox of code-pieces of their own work in 
order to ease project collaboration and teaching efforts. We, as well as students, often reinvent the wheel 
during project work, limiting the time set aside for in-depth understanding and experimenting with specific 
simulation methods. Around August 2003, OpenTissue had proven to be a valuable tool for students and 
researchers at the Department, and it was released under the terms of the GNU Lesser General Public 
License. Today, OpenTissue works as a foundation for research and student projects in physics-based 
animation at the Department of Computer Science, University of Copenhagen and it is used as a research 
platform at several universities. 

Due to the experimental nature of OpenTissue, it contains a wide variety of algorithms and data struc¬ 
tures. Some code-pieces are merely wrappers of third-party software ensuring that the programmers can 
use these tools within the common framework of OpenTissue. Other code-pieces consist of tools extracted 
from our own research projects in soft tissue simulation and medical image data acquisition. An increasing 
number of students arc beginning to contribute their own code and arc helping greatly in maintaining and 
improving OpenTissue. The list of tools in OpenTissue is constantly growing, but a summary of methods 
and data structures at the time of writing includes: 

• Rectilinear - 3D Grid Data Structures 

• Twofold Mesh Data Structure 

• Tetra4 Mesh Data Structure 

• Chan-Vese Segmentation Tools 

• Signed Distance Map Computations 

• OpenGL-Based Voxelizer 

• Quasi Static Stress-Strain Simulation (LEM) 

• Relaxation-Based Particle System Simulation (support for surface meshes, solids, cloth meshes, 
pressure soft models, self-intersection, and much more) 
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• CJK, SAT, VClip and other algorithms for Collision Detection 

• Mesh Plane Clipper and Patcher 

• QHull Convex Hull Wrapper 

• Script files for Maya and 3DMax for generating Mesh data tiles 

• Generic Bounding Volume Hierarchies with Custom Bottom-Up Construction, Single and Tandem 
Traversals, etc. 

• Multibody Dynamics Velocity-Based Complementarity Formulation 

• First-Order World Rigid Body Simulation 

• Volume Visualization Using 3D Texture-Based View Aligned Slabbing with 12-bit Preintegration 

• and much more 


25.2 Obtaining OpenTissue 

The newest version of OpenTissue, demonstrations, and download and installation instructions, etc. can 
be found at: 

www.opentissue.org 

The following is the download and installation procedure as of the time of writing: 

The development of the OpenTissue software is revisioned using Subversion ( SVN ). To download any 
version of OpenTissue, you must have Subversion installed on your system. Subversion can be obtained 
from, subversion. tigris . org. Gentoo Linux users may write, emerge -av subversion, as 
root. New users will most likely want to read the Subversion book at svnbook. org. There arc also 
gui-interfaces, TortoiseSVN, tortoisesvn. tigris, for Windows® and RapidSVN, rapidsvn. 
tigris.org, for Linux. 

For anonymous checkout through http use: 

svn co http://image.diku.dk/svn/OpenTissue/trunk OpenTissue 

You may also browse with a Web browser in the repository using the url: image .diku .dk/svn/ 
OpenTissue. When browsing, you will notice three top-level directories. Normally, you will be inter¬ 
ested in the most resent version in trunk/. 

We have collected a small subset of DataTissue, containing data for the demos only. For anonymous 
checkout of the small subset through http use: 

svn co http://image.diku.dk/svn/DataTissue/trunk/demos 

You may also browse with a Web browser in the repository using the url: image .diku .dk/svn/ 
OpenTissue. See the different READMEs in OpenTissue for details about data demo dependencies. 
The full DataTissue (more than 1.6 GB) is available from 
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svn co http://image.diku.dk/svn/DataTissue/trunk/ 

To set up the OpenTissue for development, make sure you have the prerequisites in place before trying 
to compile any of the code. These consist of: 

1. Fetching OpenTissue/trunk. 

2. Fetching DataTissue/trunk/demos. 

3. Fetching boost_l_32_0 from www. boost. org. 

4. Fetching bindings from www. opentissue . org. 

5. Set the environment variables OPENTISSUE, DATATISSUE, and BOOST. 

6. On Linux, have a link from /usr/local/include/boost to boost_l_32_0/boost. 

See the different READMEs in OpenTissue for details. 

25.3 Using OpenTissue 

OpenTissue is mainly aimed for the Windows platform, although a high-priority in its design has always 
been to use only strict standard C++ constructs. In practice, this means that all code is required also to be 
compilable under Linux with a recent g++ version. 

OpenTissue is installed simply by copying it onto a selected location on the hard drive of your own 
choice. Then an environment variable must be defined named OPENTISSUE that points to the topmost 
OpenTissue folder of your chosen location. 

25.3.1 Developer Information 

The source code of OpenTissue is located in the folder: 

$(OPENTISSUE)/OpenTissue 

Microsoft Visual Studio C+ + .NET 2003 ® ( MSVC ) solution file and project file arc located at: 

$(OPENTISSUE)/ 

Demo applications arc in: 

$(OPENTISSUE)/demos/opengl/ 

Script files for Maya® and 3DMax® arc located in: 

$(OPENTISSUE)/scripts/ 

B—, , ,—© 
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Various bat-files and MSVC project files to help setting up external dependencies have been built into: 


$(OPENTISSUE)/dependencies 


When building OpenTissue with MSVC on Windows 2000® or Windows XP® the following folder struc¬ 
ture will be generated: 


$(OpenTissue)/Release 
$(OpenTissue)/Debug 
$(OpenTissue)/lib/windows 


The first two folders arc used by the compiler to hold object files and temporaries, while the last folder 
contains the actual lib-file. Currently, the default settings of OpenTissue arc to only build a lib-file for 
static-linking. 

All demo applications have been set up to statically link with OpenTissue. Their binaries and tempo¬ 
raries arc generated in local Debug and Release folders in every demo application. 


25.3.1.1 File Structure of Data Structures 

The subfolder structure is rather flat and not very deep. Each top-level subfolder usually contains a single 
data structure, and the next level of subfolders contains I/O-routines and algorithms for the data structure: 


$(OPENTISSUE)/OpenTissue/bitmap 
$(OPENTISSUE)/OpenTissue/bvh 
$(OPENTISSUE)/OpenTissue/geometry 
$(OPENTISSUE)/OpenTissue/math 
$(OPENTISSUE)/OpenTissue/map 
$ (OPENTISSUE) /OpenTissue/mesh 
$(OPENTISSUE)/OpenTissue/spline 
$(OPENTISSUE)/OpenTissue/tetra4 


2D Bitmap data structure 

Generic Bounding volume hierarchy data structure 

Geometric Primitives 

Vector, Matrices, Quaternions, CoordSys 

3D rectilinear grid data structure 

Twofold mesh data structure 

Nonuniform B-Splines 

Tetrahedral Mesh data structure 


25.3.1.2 File Structure of Utilities 

OpenTissue also contains various utilities for different tasks, such as visualization, interaction, and wrap¬ 
pers for third-party software. Each utility is contained in a subfolder of its own: 


$(OPENTISSUE)/OpenTissue/mc 
$(OPENTISSUE)/OpenTissue/RenderTexture 
$(OPENTISSUE)/OpenTissue/trackball 
$(OPENTISSUE)/OpenTissue/utility 
$(OPENTISSUE)/OpenTissue/xml 
$(OPENTISSUE)/OpenTissue/volvis 


Isosurface extractor 
RenderTexture Wrapper 
Virtual Trackball 
Various utilities 
PUG XML wrapper 
Direct Volume Visualization 
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25.3.1.3 File Structure of Simulation Methods 

All simulation methods using dynamics have been collected in a single top-level subfolder: 


$(OPENTISSUE)/OpenTissue/dynamics 


Algorithms based on kinematics arc intended to be placed in another single top-level subfolder named: 


$(OPENTISSUE)/OpenTissue/kinematics 


Beneath the simulation method subfolder, another folder structure is used, indicating a classification of 
the simulation method. For instance, particle systems arc placed in a particle system subfolder, multibody 
dynamics in a multibody folder and so on: 


$ (OPENTISSUE) /OpenTissue/dynamics/cfd Computational Fluid Dynamics 

$ (OPENTISSUE) /OpenTissue/dynamics/multibody Multibody Dynamics 

$ (OPENTISSUE) /OpenTissue/dynamics/particleSystem Particle System 

$ (OPENTISSUE) /OpenTissue/dynamics/deformation/terzopoulos/ Elastic Deformable Models 


25.3.2 Dependencies _ 

OpenTissue has dependencies on a great many deal of third-party software. These third-party software 
libraries arc distributed as binaries with OpenTissue or incorporated directly in the source code of Open- 
Tissue, except for Boost, which you must download and install yourself. The list of third-party software 
includes: 

• Boostwww.boost.org 

• RenderTexture www.markmark.net/misc/rendertexture.html 

• Atlas math-atlas . sourcef orge . net 

• PUG XML www.codeproject.com/soap/pugxml.asp 

• Matlab®www.mathworks . com/products/matlab 

• QHull www.qhull.org 

• GLEWglew.sourceforge.net 

• DeVILopenil.sourceforge.net 

• Cgdeveloper.nvidia.com 

• MovieMaker®(from NVSDK®) developer . nvidia. com 

• PATH from CPNET www. cs . wise . edu/epnet/epnetsof tware 
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As of this writing the OpenTissue-specific code parts contain the following dependencies on external 
libraries: 


OpenTissue/map/util/cscgl 

-a 

RenderTexture, openGL 

OpenTissue/map/util/voxelizer 


RenderTexture, openGL 

OpenTissue/map/CgUtil/* 

-a 

RenderTexture, Cg 

OpenTissue/math/algebra/* 

-a 

Atlas 

OpenTissue/math/lcp/dantzigBruteUblasAtlasAdapter 

-> 

Atlas 

OpenTissue/math/lcp/path.* 

—> 

PATH 

OpenTissue/math/lcp/m2cLemke 

-+ 

Matlab® 

OpenTissue/mesh/util/ConvexHull 


QHull 

0penTissue/tetra4/util/qsss 

-*■ 

Atlas 


If you do not need some or all of these parts of OpenTissue, you should just remove them from your 
project file. 


25.3.3 Sample Data 

In our DEMO applications we use an environment variable, DATATISSUE. This should point to the folder 
location, where the demo application will look for data files. Sample data can be downloaded from the 
OpenTissue Web page: 

• www.opentissue.org 

Or freely available volume data sets (CT-scans and more) can be downloaded from the Web pages: 

• www.osc.edu/~ jbryan/VolSuite/downloads.shtml 

• openqvis.sourceforge.net/index.html 

25.3.4 Guidelines for Developing in OpenTissue 

For those with write access to the OpenTissue Subversion ( SVN ) Repository, we kindly ask to follow some 
simple guidelines: 

• Every file should at least contain the Lesser GPL license header. 

• All headers should have the pragma once lines added immediately after the license header. 

• The included guarding should reflect the file structure and tile name of the header tile, as an example: 

OpenTissue/map/map. h uses OPENTISSUE_MAP_MAP_H. 

• Everything should be encapsulated into the OpenTissue namespace. 

• No two classes should have the same name, even if they arc located in different folders. 

• Please make your own local copy of project files and solution files so you do not overwrite the ones 
in the repository with your specific settings. If you use Makefiles, then please do not pollute the 
Makefiles in the repository, which will prevent others from building OpenTissue. 
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• Please try to follow our naming conventions and usage of uppercase and lowercase letters. 

• No data files should be added directly to OpenTissue. 

• Please separate your data structure from algorithms and input/output routines. A good example of 
this is the map part of OpenTissue. The actual map data structure is located in the folder 

$(OPENTISSUE)/OpenTissue/map/ 

Various classes for reading and writing different map file formats arc located in 
$(OPENTISSUE)/OpenTissue/map/io 
All algorithms working on the map data structure arc located in 

$(OPENTISSUE)/OpenTissue/map/util 

• Please always document your code as well as possible. We use Doxygen style (www. doxygen . 
org) throughout all our code. We stride toward documenting code on class level, method, and 
member level as well. For methods this should include full description of all arguments and return 
values. If part of an algorithm, the intended caller/callee relationship would be nice to have in the 
long description of the method. 
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Notation, Units, and Physical Constants 


With such a large and diverse field as physics-based animation, notation is doomed to be creatively and 
inconsistently used. A good attempt at a unified notation is found in [Weisstein, 2005b, Weisstein, 2005a], 
Below we summarize the notation used in this book. 

The international system of units and the fundamental physical and chemical constants form a ba¬ 
sis of the physics and chemistry as we know it. The following have been selected from [NIST, 2005, 
Kaye et al., 1995] for convenience. 
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C 

Z,M,C 

a,b,c,... ,a,P, 7 

lal 



Xl 


X2 

X = 



Xn 

11*11 , 

|A[ 

* 

ii n 2 

= v 

* = 

[1*1 


> 2 

A X i 


xy,xxy 

&xi i ^-z 

A — {djj}, -A — 

AB — {y^j. Uikbk'i I 

A' 1 

4 ■ A ■ 

A = [A*i | A*2 | • • • | J- 
det A = | A\ 

1 

a x = A 

— — d 
dx —y* 

x — dt 

r, v = r, a = f 
da, dx, dA 
, d T 

V = 


V. 

V/ 

SL 

<5/ 

A = V r 
div = V • 
curl = V x 


A set 

The set of integers, the real line, and the complex plane 
A scalars or scalar field, e.g. a 6 M or / 6 M 3 —> M 
The absolute value of a 


A vector or vector field e.g. a: £ M 3 or y : M 3 —> M 3 


A general norm 

The Euclidean vector norm 

The vector of absolute elements 

The dot and cross product between two vectors 

The unit vectors defining a coordinate axes 

A general matrix and its transpose 

The matrix product 

The inverse of a matrix 

The i’th row and j’th column of matrix A 

Matrix concatenation 

The determinant of matrix A 

The identity matrix, having 1 on the diagonal and 0 elsewhere 
A cross matrix, such that a x b = a x b = Ab 
The parti at differential operator w.r.t. coordinate axis e x 
The time derivative of x 

Typically the position, velocity, and acceleration 
The differential of a scalar, vector, and matrix 

The gradient operator 

The gradient operator restricted to the elements of x 

The functional derivative 

The variational derivative 

The Jacobian matrix 

The divergence operator 

The curl or rotation operator 


Table 26.1: Often-used notation 
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Name 

Domain 

Implementation 

Spatial Nabla 


^ ^[ x ->y-> z ] T 

d x 

dy 

pz_ 

(26.1) 

Gradient 

/ : l 3 -> K 

V/ : M 3 —> M 3 

v/ = 

fx 

fy 

Jz_ 

(26.2) 

Jacobian 

u = [u,v,w] T : M 3 —T M 3 
Vm : M 3 -»• M 3x3 

(Vu) T U X Uy u z 

Vti= (Vv) T = v x v y v z (26.3) 

L(Vw) T J [ w x Wy Wz_ 

Divergence 

u = [u,v,iu] T : M 3 —r M 3 
div u : M 3 —> M 

div u = V • u = u x + v y + w z (26.4) 

Rotation 

u = [u,v,w] 7 : M 3 -> M 3 
curl m:R 3 ->8 3 

curl u = V x u = 

Wy - V Z 

u z - w x 

Vx ^ y 

(26.5) 

Laplace 

/ : M 3 —> M 

A/ : M 3 -»• R 

A/ = V 2 / = V • V/ (26.6a) 

= fxx “1“ fyy H - fzz (26.6b) 

Laplace 

u = [u, v, w] 7 : M 3 —> M 3 
Au : M 3 —► M 3 

A u = V 2 m = (y T (Vm) T ) T (26.7a) 

^xx H - ^ yy ^zz 

= V X X Vyy + v zz (26.7b) 

^xx “1“ ^yy ^zz 


Table 26.2: Common differential operators used on vector and scalar fields in three-dimensional space. 
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English Case English Case English Case 


Name 

Lower 

Upper 

Name 

Lower 

Upper 

Name 

Lower 

Upper 

Alpha 

a 

A 

Iota 

L 

I 

Rho 

P 

P 

Beta 

P 

B 

Kappa 

K 

K 

Sigma 

a 

X 

Gamma 

7 

r 

Lambda 

A 

A 

Tau 

T 

T 

Delta 

5 

A 

Mu 

P 

M 

Upsilon 

V 

T 

Epsilon 

e 

E 

Nu 

V 

N 

Phi 

(t> 

$ 

Zeta 

c 

Z 

Xi 

i 


Chi 

X 

X 

Eta 

V 

H 

Omicron 

o 

O 

Psi 


T 

Theta 

e 

0 

Pi 

7r 

n 

Omega 

UJ 

n 


Table 26.3: The Greek alphabet. 


Unit 

Name 

Measures 

m 

Meter 

Length 

g 

Gram 

Mass 

s 

Second 

Time 

A 

Ampere 

Electrical Current 

K 

Kelvin 

Temperature 

mol 

Mole 

Molecules 

cd 

Candela 

Luminous Intensity 

Table 26.4: 

International System of Units (SI) 


Prefix 

Exponential value 

Modifier 

Giga 

To 5 

G 

Mega 

10 6 

M 

Kilo 

10 3 

k 

Hecto 

10 2 

- 

Deka 

10 1 

- 

Base 

10 ° 

- 

Deci 

10” 1 

d 

Centi 

10” 2 

c 

Milli 

10" 3 

m 

Micro 

10“ 6 

P 

Nano 

10-9 

n 


Table 26.5: Metric modifiers. 
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Unit 

Name 

Measures 

Hz = s -i 

Hertz 

Frequency 

N = mkgs -2 

Newton 

Force 

J = m 2 kg s -2 

Joule 

Energy 

Pa = kgm ^ 1 s ~ 2 

Pascal 

Pressure and stress 


Table 26.6: Some derived units. 


Symbol Description 

G = 6.6742 HO 11 m 3 kg ' 1 s~ 1 Newtonian constant of gravity 
R = 8.314472J mol -1 K _1 Molar gas constant 
c = 299792458m s -1 Speed of light in vacuum 

Table 26.7: Some fundamental constants. 


Density Shear viscosity Kinematic viscosity Young’s modulus Poisson’s ratio 

Material p /(kg m 3 ) rj /(Pa s) /v/(m 2 s ') GPa 


Aii- 

1.18 

1.82 x 10“ b 

1.54 x 10“ 5 

- 

- 

Bone 

200 

- 

- 

14 

0.43 

Cork 

180 

- 

- 

0.032 

0.25 

Glass 

2.5 x 10 3 

CM 

o 

T —1 

1 

OO 

O 

T—1 

10 15 -10 18 

65 

0.23 

Granite 

260 

- 

- 

66 

0.25 

Honey 

1.4 x 10 3 

14 

1 x 10~ 2 

- 

- 

Ice 

920 

- 

- 

9.1 

0.28 

Olive oil 

0.9 x 10 3 

6.7 x 10 -2 

7.4 x 10~ 5 

- 

- 

Steel 

780 

- 

- 

210 

0.28 

Water 

1.00 x 10 3 

1.00 x 10 -3 

1.00 x 10~ 6 

- 

- 

Blood 

1.06 x 10 3 

2.7 x 10 -3 

2.5 x 10“ 6 

- 

- 


Table 26.8: Some physical properties. 

Materials 

Static Frictions 

Kinematic Frictions 

Steel on Steel 

0.74 

0.57 

Rubber on Concrete 

1.0 

0.8 

Glass on Glass 

0.94 

0.4 

Ice on Ice 

0.1 

0.03 


Table 26.9: Some friction coefficients. 
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narrow-phase detection, 399, 433 
response, 106, 267, 333 
sequential, 152 

sequential resolving method, 125 
simultaneous, 152 
single-point, 125 
comparison function, 717 
complementarity condition, 153, 189 
complexity, 405 

compliant contact model collision law, 130 
compression 
phase, 128 

plane of maximum, 146 
computational 

mechanics, 260 
molecule, 709 
computer graphics model, 3 
cone-based spring creation method, 274 
conjugate 

directions, 625 
gradient, 329 
gradient descent, 624 
Gram-Schmidt process, 626 
conservation 
of mass, 380 

of mechanical energy, 672 
Conservative Advancement, 168 
conservative force, 333, 671 
consistent mass matrix, 356 
constraint 
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bilateral, 198 
hard, 295 
kinematic, 200 
scleronomous, 199 
constraint-based simulation, 106, 125 
contact 

colliding, 128 
condition, 189 

determination, 435, 437, 504 
force, 183, 668 

force equivalence principle, 152 
formulation, 442 
generation, 558 
normal, 125, 437, 668 
plane, 437, 668 
point, 91, 437 
principal, 437 
region, 441 
resting, 128 
separating, 128 
single point of, 667 
static, 125 
tracking, 447 
unilateral, 198 
velocity, 127 
contact forces, 332, 433 
continuity, 745 
continuity equation, 380 
continuous collision detection, 9, 516 
continuous index, 687 
continuum models, 305 
control points, 739 
convention 

contact normal, 184 
Einstein’s summation, 591 
tetrahedra mesh notation, 337 
convergence 
Fourier, 711 
Taylor, 710 

convex decomposition, 555 
convex polyhedra, 534 
coordinate 


barycentric, 337, 512, 531 
generalized, 683 
homogeneous, 46 
material, 305 
natural, 337 
sorting, 416 
world axes, 653 
coordinate system, 573 
left-handed, 579 
orthogonal, 573 
right-handed, 579 
world, 406, 460, 573, 649 
coriolis acceleration, 207 
coriolis force, 129 

Coulomb’s friction law, 120, 134, 193, 667 
coupled differential equations, 695 
covariance matrix, 475 
Cox de Boor definition, 727 
critical damping, 680 
cross product, 572 
matrix, 291, 581 
Clout’s algorithm, 609 
CULLIDE, 525 
curvature, 309 
mean, 309 
tensor, 309 
testing, 497 
cusps, 762 
cylinders, 405 
Cyrus-Beck clipping, 543 

D’Alembert’s principle, 683 
damped oscillator 
critically, 101 
harmonic, 94, 269, 678 
heavily, 101 
damping 

coefficient, 95, 355, 678 
coefficient of mass, 367 
coefficient of stiffness, 367 
constant, 298 
critical, 680 
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density, 306 

force, 95, 306, 355, 678 

matrix, 326, 692 

matrix element, 356 

over, 679 

Rayleigh, 367 

time, 102 

viscosity force, 269 
data acquisition and modeling, 2 
data points, 741 
de Boor algorithm, 762 
deformable objects, 259, 305, 470 
degenerate mesh, 276 
Denavit-Hartenberg, 57 
notation, 22 
density, 378 

damping, 306 
Lagrangian, 306 
of damping, 306 
of energy, 306 
of mass, 306 
derivative 

directional, 594 
functional, 597 
test, 546 
total, 587 

variational, 643, 688 
design matrix, 612 
determinant of a matrix, 578 
differential, 587, 613 
forms, 587 
geometry, 308 
volume, 306 

differential equation, 695 
coupled, 695 
ordinary, 118, 262, 695 
partial, 695 

quasi-linear second-order partial, 707 
second-order, 265 
second-order ordinary, 652, 676 
stiff ordinary, 701 
unstable, 316 


dimension reduction and value propagation, 665 
Dirac delta function, 394 
directional derivative, 594 
Dirichlet boundary conditions, 331 
discrete filtering kernels, 320 
displacement, 206, 335 
function, 341 
nodal, 340 
vector, 316 
virtual, 348 
diverge, 701 
divergence, 593 
divergence theorem, 593 
dot product, 572 
dry friction, 668 
dynamic 

algorithm, 413 
friction, 120, 134, 668 
dynamics, 3 

multibody, 87 

edge clipping, 541 
eigenvalue, 477, 582 
problem, 477 
eigenvector, 477, 582 
Einstein’s summation convention, 591 
elastic 

energy, 128, 305 
force, 305 

potential energy, 306 
element 

damping matrix, 356 
mass matrix, 356 
stiffness matrix, 349 
surface force, 350 
elliptic, 706 
end effector, 45 
energy 

conservation of mechanical, 672 
density, 306 
elastic, 128, 305 
function, 270, 293 
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kinematic, 128 
kinetic, 671 
kinetic contact, 139 
mechanical, 672 
potential, 293, 332, 671 
potential elastic, 306 
rotational kinetic, 675 
translational kinetic, 675 
engineering shear strain, 693 
equations of motion, 307, 647 
equilibrium, 109, 335, 682 
hydrostatic, 387 
position, 109 
shape, 308 
static, 340, 348 
system in, 87, 282 
error-correcting methods, 178 
Euler angles, 46, 653 
Euler scheme, 187, 194 
explicit first-order, 104 
implicit first-order, 106 
Euler’s 

equation, 379, 658 
explicit method, 266 
Euler-Lagrange equation, 262, 641 
exhaustive search, 413 
experimental mechanics, 260 
explicit 

Euler integration, 266, 697, 700 
first-order Euler scheme, 104 
integration, 299 
midpoint method, 700 
solution, 709 
exponential springs, 122 
external forces, 306 
External Voronoi Region, 536 

factor 

rigidity, 272 
FDM, 261 

feature-based algorithm, 434, 525 
FEM, 262, 335 


FFD, 288 
FIFO, 457 

finite difference, 305 
method, 261 
operators, 318 

finite element method, 262, 335 
finite volume method, 261 
first fundamental form, 308 
first-in-first-out queue, 457 
first-order world, 247 
fit-and-split strategy, 472 
fixed axis angles, 653 
fixed joints, 219 
fluid 

incompressible, 380 
inviscid, 381 
viscous, 381 
force 

balance equation, 350 
conservative, 333, 671 
contact, 183, 332, 433, 668 
Coriolis, 129 
damping, 306, 355, 678 
damping viscosity, 269 
elastic, 305 
element surface, 350 
external, 306 

friction, 120, 134, 193, 668 
frictional motor, 228 
generalized, 684 
gravity, 332 
inertia, 340, 355 
lineal - viscosity, 678 
N-ary, 270 
node, 340 

nonconservative, 672 
normal, 668 
penalty, 90, 332 
potential, 270 
pressure, 281 
spring, 332 
unary, 270 
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viscous, 332 

force-based formulations, 183 
form 

strong, 260 
variational, 261 
weak, 261 
forward 

dynamics, 3 
kinematics, 3, 45, 51 
four principles, 403 
Fourier convergence, 711 
frame 

body, 17, 126, 573, 653 
Frenet, 285 
inner, 17 
model, 460 
outer, 17 

free-form deformations, 288 
Frenet frame, 285 
frequency, 95, 371, 677 
natural, 95, 269, 676 
friction, 193, 381 
air, 269 

air coefficients, 270 
coefficient, 121, 668 
cone, 147, 193, 669 
Coulomb’s law, 120, 134, 193, 667 
dry, 668 

dynamic, 120, 121, 134, 668 
force, 120, 134, 193, 668 
impulse, 195 
motor force, 228 
pyramid, 193 
sliding, 134, 668 
stable static, 134, 668 
static, 120, 121, 134 
tangential constraint, 221 
torsional, 197 
unstable static, 134, 668 
full deformation collision law, 130 
Full-Coordinate methods, 207 
fully conservative coverage, 465 


functional derivative, 597 

Galerkin’s method, 261 
gas constant, 282 
Gauss elimination, 605 
Gauss’ theorem, 593 
Gauss’s theorem, 665 
Gauss-Jordan elimination, 605 
Gauss-Seidel method, 614 
Gaussian kernel, 392 
general assembly, 354 
general position, 338 
generalized 

coordinates, 683 
force, 684 
joint limits, 226 
joint parameters, 45 
orientation, 185 
position, 185 
velocity, 185 

geometric algorithms, 557 
geometrical shapes, 2 
geometry vectors, 764 
GJK, 445, 555 
global interpolation, 741 
global parameter, 746 
gradient, 592 
gradient descent, 621 
gravitational acceleration, 91 
gravity 

force, 332 
waves, 372 

Green’s strain tensor, 691 

half-life time, 104 
Hamilton’s principle, 686 
Hamiltonian formulation, 89 
hai'd constraints, 295 
harmonic oscillator, 89, 269, 675 
damped, 94, 269, 678 

hash 

function, 530 
map, 530 
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Hausdorff Distance, 467 
heap, 171 
heat, 128 

heat diffusion equation, 709 
Helmholtz-Hodge decomposition, 381 
heterogeneous bounding volume hierarchy, 463 
hierarchical decomposition, 529 
hierarchical hash tables, 421 
Hilbert-Schmidt norm, 311 
hinge joint, 209 
limit, 225 
hinge-2 joint, 215 
holonomic constraints, 198 
homogeneous bounding volume hierarchy, 463 
homogeneous coordinates, 46 
Hooke’s Spring Law, 94, 268, 315, 347 
Hooke’s spring law, 676 
hybrid bounding volume hierarchy, 463 
hydrostatic equilibrium, 387 
hyperbolic, 706 
hypothesis 

Poisson’s, 132 
Stronge’s, 133 

Ideal Gas Law, 282, 382 
ideal wave, 373 
identity matrix, 576 
image-based techniques, 557 
impact zone, 445 
implicit 

Euler integration, 700 
first-order Euler scheme, 106 
function, 293, 557 
integration, 299, 326, 701, 702 
scheme, 266, 327 
impuls-based simulation, 108 
impulse, 129, 183 
collision, 125, 129 
force equivalence principle, 152 
friction, 195 
plastic sliding, 139 
plastic sticking, 140 


propagating, 152 
space, 139 
time-integral, 125 
impulse-based simulator, 125 
Impulse-Momentum Law, 133 
impulse-momentum relation, 136 
incompressible fluid, 380 
incremental collision law, 125, 130 
Incremental methods, 480 
inertia forces, 340, 355 
inertia tensor, 661 
initial conditions, 269 
initial value problem, 262 
inner frame, 17 
insertion sort, 417 
integration, 131 

4th-order Runge-Kutta, 700 
analytical volume, 664 
by dimension reduction and value propaga¬ 
tion, 665 
by parts, 642 
explicit, 299 

explicit Euler, 266, 697, 700 
explicit midpoint, 700 
implicit, 299, 326, 701, 702 
implicit Euler, 700 
numerical, 79, 89, 118 
semi-implicit, 328, 703 
semi-Lagrangian time, 388 
time, 172, 366 
trapezoidal, 700 
Verlet, 267 

volume by decomposition, 664 
interference query, 506, 507 
internal pseudo Voronoi regions, 552 
International System, 378 
interpolation 
bilinear, 287 
global, 741 
splines, 72 
trilinear, 273 
intersection query, 435 
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interval of dependence, 719 
inverse 

dynamics, 3 
kinematics, 3, 45, 52 
matrix, 577 
inviscid 

fluid, 381 
gas, 384 

inward mesh extrusion, 275 
isotropic elastic stress tensor, 693 
isotropic elasticity matrix, 347 
isotropic materials, 340 
iterative traversal, 457 
IVP, 262 

Jacobi method, 614 
Jacobian 

matrix, 53, 297, 300, 592 
joint, 15 

axis, 206 

ball-in-socket, 199, 208 
fixed, 219 

generalized parameters, 45 

hinge, 209 

hinge-2, 215 

limits, 222 

motor, 227 

parameters, 45 

prismatic, 57 

revolute, 57, 209 

slider, 212 

universal, 218 

wheel, 215 

jointed mechanism, 87 

key-frames, 72 
kinematic 

constraint, 200 
viscosity, 378, 381 
kinematics, 3, 45 

forward, 45, 51, 71 
inverse, 45, 52 
principle, 413 


kinetic 

contact energy, 139 
energy, 128, 671 

knot 

insertion, 748 
insertion algorithm, 752 
value, 727 
vector, 288, 727 
Kronecker symbol, 190 

Lagrange 

equation, 686 
formulation, 682 
multipliers, 688 
Lagrangian, 686 

density, 306, 687 
formulation, 89, 307 
model, 307 
strain tensor, 691 
Lame coefficients, 693 
Landau symbol, 636 
Laplace 

equation, 716 
operator, 595 
law 

algebraic collision, 130 
collision, 130 

compliant contact model collision, 130 
Coulomb’s friction, 120, 134, 193, 667 
full deformation collision, 130 
Hooke’s spring, 94, 268, 315, 347, 676 
ideal gas, 282, 382 
impulse-momentum, 133 
incremental collision, 125, 130 
Newton’s first, 649 
Newton’s impact, 132 
Newton’s second, 265, 649 
Newton’s third, 649 
physical, 2 

layered depth image, 276, 559 
LCP, 190, 629 
LDI, 276, 559 


© 


© 




“book” — 2005/9/30 — 15:44 — page 811 — #823 


© 


a 


PHYSICS-BASED ANIMATION 


811 


left-handed coordinate system, 579 
level set equation, 724 
Lin-Canny, 533 
line search. 622 
linear 

advection, 721 
Cauchy strain matrix, 342 
Cauchy strain tensor, 358 
complementarity problem, 158, 190 
interpolation, 72 
least squares, 612 
set of equations, 605 
velocity, 127 
viscosity force, 678 
Lineal - Complementarity Problem, 629 
Linear Elastostatic Model, 340 
lineal' programming, 630 
link, 15 

length, 22 
offset, 22 
twist, 22 
link rotation, 22 
local parameter, 747 
local support, 745 
locality principle, 410 
LP, 630 
LU 

decomposition, 609 
factorization, 329 
lumped mass matrix, 357 

Maclaurin series, 635 
mass damping coefficient, 367 
mass-spring systems, 271 
material coordinates, 305 
mathematical 
model, 2 
programming, 7 
tools, 2 

mathematical programming, 630 
matrix, 575 

addition, 576 


covariance, 475 
cross product, 291, 581 
damping, 326, 692 
design, 612 
determinant, 578 
element damping, 356 
element mass, 356 
element stiffness, 349 
elements, 575 
field, 587 
function, 588 
identity, 576 
inverse, 577 
isotropic elasticity, 347 
Jacobian, 53, 297, 300, 592 
linear - Cauchy strain, 342 
lumped mass, 357 
multiplication, 576 
quaterion, 604 
stiffness, 320, 340, 358, 692 
trace, 577 
transpose, 577 
Vandermonde, 639 
maximum principle, 717 
maximum variance, 475 
mean curvature, 309 
mean value theorem, 635 
mechanical energy, 672 
conservation of, 672 
mechanics, 260 
applied, 260 
computational, 260 
experimental, 260 
theoretical, 260 
mesh 

cartoon, 288 
connectivity, 275 
coupling, 273, 288, 367 
degenerated, 276 
inward extrusion, 275 
surface, 274 
tetrahedal, 273 
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triangular, 299 
unstructured, 272 
unstructured solid, 273 
volume, 273 
method 

least squares, 261 
splitting, 711 
time-integration, 172 
metric tensor, 308, 316 
Microsoft Visual Studio C++ .NET 2003, 771 
midpoint method, 699 
minimal coordinate methods, 7 
minimum norm solution, 292 
model 

frame, 460 
relaxation, 310 
space, 460 
space update, 460 
modified tandem traversal, 496 
motion 

alignment, 13 
blending, 13 
capture, 13 

constraint matching, 13 
interpolation, 71 
programs, 13 
time-warping, 13 
MSVC, 771 
multibody, 87 
animation, 87 
dynamics, 87 
multilevel gridding, 421 
multiple queries, 402 
multiplicity, 745 
myopic view, 437 

N-ary forces, 270 

narrow-phase collision detection, 399, 433 
natural coordinates, 337 
natural frequency, 95, 269, 676 
Navier-Stokes equations, 381 
Newton’s 


Collision Law, 148 
first law, 649 
impact law, 132 
second law, 265,649 
third law, 649 

Newton-Euler equations, 89, 184, 647 
nodal displacements, 340 
node forces, 340 
non-holonomic constraints, 201 
nonconservative forces, 672 
nonlinear - anisotropic material properties, 7 
nonlinear - Green-Lagrange strain tensor, 359 
norm 

Hilbert-Schmidt, 311 
vector, 574 

weighted Frobenius, 311 
normal 

angle-weighted vertex, 274 
force, 668 

normalized basis function, 727 
nullspace, 611 

numerical integration, 79, 89, 118 

OBB, 405, 461 
object space algorithms, 557 
ODE, 262, 652, 695 
OpenTissue, 769 
optimal spatial hashing, 530 
order, 695, 727 

ordinary B-spline basis function, 749 
ordinary differential equation, 89, 118, 262, 695 
orientation generalized, 185 
oriented bounding box, 405, 461 
origin, 573 

orthogonal coordinate system, 573 
orthonormal basis, 573 
outer frame, 17 
over-constrained system, 611 
over-damping, 679 
over-relaxation, 260 

paired joint coordinates, 17 
pairwise tests, 455 
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parabolic, 706 

partial differential equation, 695, 703 
particle, 265, 647 

level set methods, 7 
system, 266, 270, 650 
trajectory, 284 
PC, 437 
PCS, 525 
PDE, 695, 703 
penalty 

force, 90, 332 
function, 293 
penetration, 128, 547 
penetration depth, 332 
period, 677 

permissible region in impulse space, 147 
Petrov-Galerkin’s method, 261 
phase, 677 

compression, 128 
restitution, 128 
space, 265 
physical laws, 2 
physical realism, 1 
physics-based animation, 1 
pitch, 46, 653 
pivot, 608 

plane of contact, 437 

plane of maximum compression, 146 

plastic sliding impulse, 139 

plastic sticking impulse, 140 

plausible simulation, 1 

point 

anchor, 206 
break, 741 
control, 739 
data, 741 
of action, 649 
point location, 401 
Poisson equation, 381 
Poisson’s hypothesis, 132 
Poisson’s ratio, 347, 693 
position 


general, 338 
generalized, 185 
of center of mass, 652 
potential 

elastic energy, 306 
energy, 671 
energy function, 332 
fields, 293 
force, 270 
function, 293 

potentially colliding set, 525 
predictions, 3 
pressure, 378 
force, 281 
soft bodies, 281 
principal contact, 437 
principle 

Animation, 1 
approximation, 405 
coherence, 412 

contact force equivalence, 152 
D’Alembert’s, 683 
Hamilton’s, 686 
impulse equivalence, 152 
kinematics, 413 
locality, 410 
maximum, 717 
of animation, 1, 13 
of virtual work, 689 
virtual work, 296 
Principles of Animation, 13 
prismatic joint, 15, 57 
projection error-correction, 178 
projection method, 333, 382 
propagating impulses, 152 
proximity query, 435, 506, 513 
pseudoinverse, 55, 292, 613 

QP, 630 

quadratic programming, 630 
quaternion, 600, 667 
matrix, 604 
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product, 600 
rotation, 601 
query 

interference, 506, 507 
intersection, 435 
proximity, 435, 506, 513 
recursive collision, 457 
visibility, 525 

radius of convergence, 636 
Raleigh-Ritz method, 262 
range searching. 401 
Rayleigh damping, 367 
recursive 

collision query, 457 
methods, 7 

search algorithms, 552 
tandem traversal, 457 
Reduced Coordinate methods, 207 
regular B-splines, 757 
regular spline, 285 
relative contact normal velocity, 132 
relaxation, 295 
model, 310 
over, 260 

successive over, 617 
resonance effects, 108 
rest shape, 310 
resting contact, 128 
restitution 

coefficient, 128, 333 
phase, 128 

retroactive detection approach, 229 
retroactive time control, 235 
revolute joint, 15, 57, 209 
right-handed coordinate system, 579 
rigid body assumptions, 129 
rigid body simulation, 87 
rigidity factor, 272 
rigidity tensor, 317 
roll, 46, 653 
rolling contact, 195 


rotation, 593 

quaternion, 601 
theorem, 593 

rotational kinetic energy, 675 
Runge-Kutta method, 699 

sample-based coverage, 465 
scalar, 571 
field, 587 
scheme 

Euler, 187 

implicit integration, 299, 326 
semi-implicit integration, 328 
semi-Lagrangian time integration, 388 
time-stepping, 173 
scientific computing, 5 
scleronomous constraint, 199 
scripted 
body, 76 
motion, 76 

second fundamental form, 309 
second-order differential equation, 265 
ordinary, 652, 676 

second-order partial differential equations 
quasi-linear, 707 
secondary oscillations, 108 
self-intersections, 434, 494 
semi-implicit integration, 328, 703 
semi-Lagrangian time integration, 388 
separating contact, 128, 195 
sequential collision resolving method, 125 
sequential collisions, 152 
SF, 260 

shallow water equations, 387 
shape functions, 342 
shear viscosity, 378, 381 
shearing diagonals, 273 
shearing spring, 272 
shock propagation, 176, 268 
SI, 378 

signed distance map, 534, 557 
simplex methods, 9 


© 


© 



“book” — 2005/9/30 — 15:44 — page 815 — #827 


© 


a 


PHYSICS-BASED ANIMATION 


815 


simplex-based algorithm, 434 
simulation 

loop, 91, 399 
method, 399 
rigid body, 87 
simulator 

constraint-based, 125 
impulse-based, 108, 125 
velocity-based, 125 
simultaneous collisions, 152 
single point of contact, 667 
single traversal, 496 
single-point collision, 125 
singular value decomposition, 292, 611 
slider joint, 212 
slider joint limit, 222 
sliding contact, 195 
sliding friction, 134, 668 
smoothing kernel, 392 
SOR, 617 
space product, 573 
spatial 

data structures, 434 
diagonals, 273 
hashing, 368 
subdivision, 434, 451 
spatial-temporal coherence analysis, 125 
spheres, 405, 461 
splatter method, 478 
splitting method, 473, 711 
splitting plane, 473 
spring, 268, 676 
angular, 279 
bending, 272 
coefficient, 94, 676 
constant, 298 
creation method, 272 
exponential, 122 
force, 332 
shearing, 272 
structural, 271 
surface, 272 


volume preserving, 274 
spring-damper system, 89 
stable static friction, 134, 668 
stack layers, 177, 178 
state 

function, 667 
variable, 59 
vector, 46, 51, 296 
static 

contacts, 125 
equilibrium, 340, 348 
friction, 120, 134 
stencils, 320 
step doubling, 700 

stiff ordinary differential equations, 701 
stiffness 

damping coefficients, 367 
matrix, 320, 340, 358, 692 
warping, 359 
Stokes' theorem, 593, 665 
strain, 335, 340, 342 

engineering shear, 693 
rate, 692 
strain tensor 
Cauchy, 692 
Green’s, 691 
lineal - Cauchy, 358 
nonlinear Green-Lagrange, 359 
stress, 335, 340, 346 

2nd Piola-Kirchoff, 692 
viscous, 340 
stress tensor 

isotropic elastic, 693 
strong form, 260 
Stronge’s hypothesis, 133 
structural spring, 271 
subdivision, 473 
subdomain method, 261 
Subversion, 770 
successive over relaxation, 617 
surface meshes, 274 
surface springs, 272 
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SVD, 292, 611 
SVN, 770, 774 
Sweep and Prune, 416 
sweep line, 416 
sweeping volumes, 427 
SWIFT, 533 

systems in equilibrium, 87, 282 

tandem traversal, 451 
tangential contact velocity, 139 
tangential friction constraints, 221 
Taylor 

approximation, 266 
convergence, 710 
expansion, 53 
series, 266, 589, 635 
Taylor’s theorem, 636 
temperature, 378 
tensor 

B-spline, 288 
curvature, 309 
metric, 308, 316 
rigidity, 317 
tetrahedral mesh, 273 
tetrahedron, 335 
theorem 

Gauss’, 593 
Stokes’, 593, 665 
Taylor’s, 636 

theoretical mechanics, 260 
Three-Dimensional Linear Elastostatics Model, 
340 

tightness of a bounding volume, 467 
time 

integral impulse, 125 
integration, 172, 366 
of impact, 166, 413, 433 
of impact heap, 171 
stepping method, 166, 173, 229 
time-critical tandem traversal, 458 
TOI, 166, 413, 433 
top-down method, 472 


topological errors, 276 
torque, 658 
torsion, 309 
torsional friction, 197 
total derivative, 379, 587 
trace of a matrix, 577 
trajectory, 647, 697 
transfer-of-axe theorem, 665 
translational kinetic energy, 675 
transpose matrix, 577 
trapezoidal integration, 700 
traversal 

breadth-first, 179, 273 
iterative, 457 
modified tandem, 496 
recursive tandem, 457 
single, 496 
tandem, 451 

time-critical tandem, 458 
triangular mesh, 299 
trili near interpolation, 273 
truncation error, 709 

unary forces, 270 
unconditionally consistent, 710 
unconditionally stable, 106 
unconstrained motion, 667 
unilateral contacts, 198 
universal joint, 218 
unstable, 701 

unstable static friction, 134, 668 
unstructured meshes, 272 
unstructured solid meshes, 273 
upwind scheme, 723 

V-Clip, 445, 533 
Vandermonde matrix, 639 
variational derivative, 311, 643, 688 
variational form, 261 
vector, 571 

addition, 571 
field, 587 
geometry, 764 
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knot, 288, 727 
length, 571 
norm, 574 

of displacements, 316 
product, 572 
state, 46, 51, 296 
velocity, 378, 647 
contact, 127 
generalized, 185 
lineal - , 127 

relative contact normal, 132 
tangential contact, 139 
velocity-based formulation, 183 
velocity-based simulators, 125 
Verlet integration, 267 
VF, 261 

virtual displacement, 348, 682 
virtual work, 682 
viscosity, 270 

coefficient, 95, 678 
kinematic, 378, 381 
shear, 378, 381 
viscous 

fluid, 381 
force, 332 
stress, 340 

visibility queries, 525 

Vol, 559 

volume 

integration, 664 

integration by decomposition, 664 
intersection of, 559 
mesh, 273 

preserving springs, 274 
volume-based algorithms, 434 
volumetric representation, 557 
von Neumann boundary conditions, 329 
Voronoi 

diagram, 536 

internal pseudo regions, 552 
plane, 536 
regions, 447, 533 


Voronoi-Clip, 533 
voxelization, 273 

wave 

breaking, 377 
equation, 719 
gravity, 372 
ideal, 373 

WCS, 406, 460, 573, 649 

weak form, 261 

weighted Frobenius norm, 311 

weighted residual method, 261 

WF, 261 

wheel joint, 215 

work, 671 

balance equation, 348 
principle of virtual, 296 
virtual, 682 

work-energy theorem, 671 
world coordinate axes, 653 
world coordinate system, 406, 460, 573, 649 
world space update, 460 

yaw, 46, 653 

Young’s modulus, 347, 693 
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